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Multifunctional Recombinant Phycobiliprotein-Based Fluorescent Constructs and 

Phycobilisome Display 

Inventors: Alexander N. Glazer and Yuping Cai 
Assignee: The Regents of the University of California 

The research carried out in the subject application was supported in part by grants 
from the Department of Energy (Grant No. RDE-FG-91ER61 125). The government may 
have rights in this invention. 

INTRODUCTION 

Field of the Invention 

The field of the invention is phycobiliprotein-assisted expression and folding of 
proteins and the production of function-added recombinant phycobiliproteins. 

Background of the Invention 

Many foreign proteins expressed in bacteria form insoluble aggregates called 
inclusion bodies. Recovery of the recombinant protein of interest requires dissolving the 
inclusion bodies under denaturing conditions, removing the denaturant to allow the 
recombinant protein to fold, and finally purifying the recombinant protein. This process is 
laborious and frequently gives low yields. Inclusion bodies are formed because the 
aggregation of the newly synthesized recombinant polypeptide is faster than its folding into 
the native structure. In one aspect, the invention provides Afunctional fusion constructs 
which are rapidly incorporated into a macromolecular structure such that the fusion proteins 
are separated from one another and unable to self-associate. . 

In a more particular aspect, the macromolecular structures are oligomeric 
phycobiliproteins - a family of structurally related photosynthetic accessory proteins naturally 
found, inter alia, in cyanobacteria, the chloroplasts of the Rhodophyta (red algae) and in those 
of the Cryptophyceae. When they carry covalently attached linear tetrapyrrole prosthetic 
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groups (bilins), these proteins can exhibit brilliant colors and intense fluorescence, making 
them valuable specific labeling reagents. Unfortunately, the diversity of such reagents has 
been restricted to naturally occurring phycobiliproteins and their target specificity generated 
by chemical conjugation, which generates heterogeneity (see, e.g. Siiman et aL, 1999, 
Bioconj Chem, 10, 1090-1 106). The use of phycobiliproteins in the invention also 
overcomes these prior art limitations and provides homogenous, specific labeling reagents. 

SUMMARY OF THE INVENTION 
We have found that phycobiliprotein subunits are tolerant to terminal extensions of 
different sizes. A phycobiliprotein subunit domain in a fusion protein is able to assemble with 
a cognate partner subunit to form heterodimers, and frequently further assemble into even 
higher-order aggregates, and into the light-harvesting antenna complexes, phycobilisomes. 
The non-phycobiliprotein part of the fusion protein (the displayed domain) is exposed on the 
surface of the oligomeric phycobiliproteins. We define this fusion protein expression system 
as "phycobilisome display". Newly synthesized fusion polypeptides are quickly separated 
from each other by virtue of the assembly of the phycobiliprotein domain (the carrier domain) 
into oligomers. The non-phycobiliprotein moiety (the displayed domain) of the fusion protein 
is displayed on the surface of oligomer, e.g. the rods of the phycobilisome, and is able to fold 
into functional proteins while sequestered from interaction with other unfolded polypeptides. 
Phycobilisome display can therefore be used as an alternative folding environment for 
difficult-to-fold proteins, especially those that have been found difficult to fold in other 
organisms. The displayed protein can be used as a fusion construct bearing a fluorescent 
phycobiliprotein tag, or be separated from the carrier phycobiliprotein domain by cleavage of 
the linker peptide between the carrier domain and the displayed domain by a specific 
protease. Either embodiment may be practiced in cells (e.g. using a resident protease) or in 
vitro (e.g. using purified fusion proteins, cell-free expression systems, etc.), though cleavage 
is preferably practiced outside the cell to control its timing with respect to folding. 

Accordingly, the invention provides methods and compositions for displaying a 
functional polypeptide domain on an oligomeric phycobiliprotein. The compositions include 
fusion proteins comprising a functional displayed domain and a functional phycobiliprotein 
domain incorporated in a functional oligomeric phycobiliprotein. In particular embodiments, 
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the phycobiliprotein domain is a natural phycobiliprotein domain or modified variant thereof; 
the functional oligomeric phycobiliprotein is an a, p heterodimer; the displayed domain 
comprises an affinity tag, an oligomerizatipn moiety, a specific binding moiety and/or a 
signaling moiety; and/or the displayed domain is refractive to expression in E. coli. The 
compositions also include functional oligomeric phycobiliproteins comprising the subject 
fusion proteins and cells comprising such oligomeric phycobiliproteins. 

The subject methods include methods for making a functional displayed domain 
comprising the step of combining a polypeptide comprising a displayed domain and a 
phycobiliprotein domain with a phycobiliprotein subunit under conditions to form a subject 
fusion protein. In particular embodiments, the methods further comprise prior to the 
combining step, the step of making the polypeptide by expressing a nucleic acid encoding the 
polypeptide; and/or after the combining step, the step of separating the functional displayed 
domain from the functional phycobiliprotein domain. The methods steps may occur 
intracellularly, e.g. in a cell which is or is a progeny of a natural cell which naturally makes 
functional phycobiliprotein, or a cell engineered to produce functional phycobiliprotein. 

DETAILED DESCRIPTION OF PARTICULAR EMBODIMENTS OF THE INVENTION 

The following descriptions of particular embodiments and examples are offered by 
way of illustration and not by way of limitation. 

The invention provides methods and compositions for displaying a functional 
polypeptide (the foreign protein or displayed domain) on an oligomeric phycobiliprotein. 
The compositions include fusion proteins comprising a functional displayed domain and a 
functional phycobiliprotein domain incorporated in a functional oligomeric phycobiliprotein. 

A functional phycobiliprotein domain is capable of assembling the fusion protein in a 
functional oligomeric phycobiliprotein. Preferred domains provide at least 1%, preferably at 
least 10%, more preferably at least 50%, more preferably at least 75%, more preferably at 
least 90% and most preferably substantially equivalent oligomer assembly ability as that of a 
corresponding unfused phycobiliprotein, as measured in competition assays, e.g. as described 
herein. 

Any phycobiliprotein domain having the requisite functionality may be used and these 
may be derived from natural, semisynthetic or synthetic sequences. A wide variety of natural 

3 



0146395A1_I_> 



WO 01/46395 PCT/US00/35028 

phycobiliproteins are known in the art, e.g. Apt and Grossman, 1995, J Mol Biol 248, 79-96, 
including proteins derived from many cyanobacteria, rhodophytes (red algae) and 
cryptomonads, etc. (see, e.g. Glazer, 1994, J Appl Phycol 6, 105-1 12; Glazer et al. 1995, 
Photosynth Res 46, 93-105), particularly phycoerythrins, phycocyanins, and 
allophycocyanins. In addition, a wide variety of methods are known for modifying such 
natural sequences to generate semi-synthetics, e.g. Glazer, 1994, supra, describes 
phycobiliproteins having non-natural, predetermined bilin compositions, and Toole et al., ' 
1998, Mol Micro 30, 475-486 describes recombinations of phycobiliprotein deletion mutants. 
Finally, known phycobiliprotein structure- function relationships (e.g. Anderson et al., 1998, 
Mol Micro 30, 467-474) are exploited to generate synthetic sequence analogs using 
conventional methods. 

In a particular embodiment, the phycobiliprotein domain is characterizable as an a or 
B subunit, based on its sequence similarity to natural a and 15 phycobiliprotein subunits. The 
selection of a or B phycobiliprotein domains may yield different results, affecting the 
displayed domain, the carrier protein, or both, and such differences guide the selection of the 
carrier phycobiliprotein subunit. For example, the phycocyanin a subunit and the 6-L1 1 -a 
subunit-fusion are preferred when the foreign protein is displayed on the N-terminus of the 
carrier protein because of the higher yield and the better spectroscopic properties of the 
resulting fusion protein. For display on the C-terminus, phycocyanin B subunit and the a- 
Ll 1-B subunit fusion are better suited because the phycocyanin a subunit is sensitive to 
extension on its C-terminus (usually leading. to incomplete bilin addition and in some 
instances partial unfolding of the protein). 

In a particular embodiment, the phycobiliprotein domain confers fluorescence on the 
fusion protein, preferably providing fluorescence quantum yield and molar extinction 
coefficients at least 1%, preferably at least 10%, more preferably at least 50%, more 
preferably at least 75%, more preferably at least 90% and most preferably substantially 
equivalent to that of a corresponding unfused phycobiliprotein, measured as described herein. 
Preferred domains provide extinction coefficients of at least 100K, preferably at least 300K, 
more preferably at least 1M and/or quantum yields of at least 0.25, preferably at least 0.5, 
more preferably at least 0.6, as described herein. In other preferred aspects of this 
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embodiment, the fluorescence emission spectrum of the fusion protein is substantially 
equivalent to that of a' corresponding unfused phycobiliprotein. 

In a particular embodiment, the phycobiliprotein domain of the fusion protein 
comprises one or more bilins, preferably functional bilins contributing to the visible 

5 absorption spectrum of the phycobiliprotein domain and fusion protein, preferably natural 
bilins. In a particular embodiment, the bilins are covalently coupled to the phycobiliprotein 
domain, preferably through cysteine thioether linkages, preferably at natural bilin attachment 
sites. The bilins may be coupled to the phycobiliprotein domain by any mechanism that 
provides the requisite functionalities, including enzymatic addition. Accordingly, in a 

10 particular embodiment, the phycobiliprotein domain of the fusion protein provides a substrate 
for enzymatic bilin addition, which may provide natural or non-natural bound bilin 
distribution, preferably a substrate for enzymes which naturally modify a corresponding 
natural phycobiliprotein. 

The displayed domain may be any polypeptide having a requisite functionality and 

1 5 being compatible with that of the phycobiliprotein domain and with assembly of the fusion 
protein in a functional oligomeric phycobiliprotein. Accordingly, a wide range of displayed 
domains may be used including domains comprising an affinity tag, an oligomerization 
moiety, a specific binding moiety and/or a signaling moiety, and including polypeptides 
which may be coupled to phycobiliproteins by chemical conjugation invention (see, inter alia, 

20 US PatNos. 4,520,110; 4,542,104; 4,857,474; 5,055,556; 5,707,804; 5,728,528; and 
5,869,255). 

The displayed domain is preferably of length and sequence sufficient to provide a 
constrained structure, including at least secondary structure, preferably tertiary structure. In a 
particular embodiment, the constrained structure is of complexity sufficient to require 

25 complex folding and is poorly expressed independent of the fusion protein in active form in 
conventional expression systems, particularly conventional yeast (e.g. S. cerevisiae) and 
bacterial (e.g. E. coli) expression systems. Preferred displayed domains are refractive to 
expression when expressed independent of the fusion proteins, i.e. substantially not expressed 
in active form, in conventional expression systems, particularly E. coli. The displayed 

30 domain may have any spectrophotometric properties compatible with the requisite 

functionalities of the fusion protein. In a particular embodiment, the displayed domain is 
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. substantially transparent to the wavelengths of visible light absorbed by phycobiliproteins, 
and therefore does not substantially affect the fusion protein and/or oligomeric 
phycobiliprotein light-harvesting function, and/or is substantially transparent to the 
wavelength(s) of energy emitted by the phycobiliprotein domain, and therefore is not 
5 substantially affected by such energy. 

The displayed domain may frequently be displayed on either terminus of the 
phycobiliprotein domain - important because many proteins preferentially tolerate extension 
on one of their termini. Preferred expression orientation is readily determined empirically, 
and in many cases (e.g. for making phycobiliprotein-labeled fluorescent reagents) is advised 
10 by published C- and N-terminal GFP fusions. In particular embodiments, display on the N- 
terminus of a phycobiliprotein R subunit, rather than the a subunit, is more conducive to 
folding of certain displayed proteins. Use of the subunit-fusion phycocyanins a-Ll 1-B and Ji- 
Ll 1-a as the carrier protein may also have certain advantages: the fusion proteins tend to be 
more stable (see, e.g. Example B, below), and the 1 :1 stoichiometry of a and B subunits is 
15 ensured. 

The fusion proteins may comprise additional components as desired, which may 
provide modules of functionalities, such as affinity handles, dimer- or oligomerization 
domains, stabilization domains, specificity domains, signaling domains, etc., apart from any 
such functionality/ies provided by the displayed domain. For example, for constructs to be 
20 used as fluorescent labels, introduction of GCN oligomerization domains enhances both the 
spectroscopic value (more chromophores) and binding affinity (more sites for intermolecular 
interaction). 

In particular embodiments, the fusion protein comprises a specific binding moiety 
comprising at least one of a specific binding pair, such as a receptor - ligand pair, e.g. an 

25 immunoglobulin antigen-binding domain or antigenic domain, a lectin saccharide-binding 
domain or glycosylated or glycosylatable domain, an avidin or streptavidin biotin-binding 
domain or biotinylated or biotinylatable (i.e. providing a substrate for enzymatic 
biotinylation) domain, etc. In a particular embodiment, the protein comprises a biotinylated 
or biotinylatable domain, which is preferably biotinylated in the expression system (e.g. cell) 

30 selected for expression of the fusion protein. A wide variety of synthetic, semi-synthetic and 
natural such domains are known in the art, see e.g., Schatz et al. 1993, Bio/Technology 11, 

6 
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1 138-1 143; Tatsumi et al., 1996, Anal Biochem 243, 176-180; Samols et al. 1988, J Biol 
Chem 263, 6461-6464, including homologs in phycobiliprotein producing cyanobacteria, e.g. 
Gornicki et al. 1993, J Bacteriol 175, 5268-5272; Phung et al., GenBank Accession No. 
U59235; Nakamura et al. 1998 Nucl Acids Res 26, 63-67. In fact, enzymes sufficient to 
biotinylate biotinylatable domains have been characterized (e.g. Beckett et al. 1999, Protein 
Sci 8, 921-929; Buoncristiani et al. 1988, J Biol Chem 263, 1013-1016), permitting in vitro 
biotinylation (e.g. Li et al., 1992, J Biol Chem 267, 855-863). These biotinylated domains 
permit especially convenient affinity purification tags (e.g. Cronan 1990, J Biol Chem 265, 
10327-10333, Example C, below) and are useful in the many well developed biotin/avidin 
applications (e.g. Wilchek and Bayer (ed) 1990, Methods Enzymol 184, Academic Press, 
NY). 

In another example, various spacers or flexible linker peptides providing a variety of 
functionalities, such as a specific endopeptidase recognition and/or cleavage site, an affinity- 
purification tag, etc., may be used between the displayed and phycobiliprotein domains. For 
example, when displayed C-terminally to the phycobiliprotein domain, a specific protease 
recognition and cleavage site can be engineered immediately upstream from the displayed 
domain so, upon cleavage with the protease, the displayed domain can be cleanly released 
from the carrier protein. This strategy also works for most proteins displayed on the N- 
terminus of the carrier protein because the functions of most displayed proteins are not 
affected by C-terminal extensions several residues long. In situations where such C-terminal 
extension is highly undesirable, an intein domain (Perler FB, Jan 1, 2000, Nucleic Acids Res 
28, 344-345 "InBase, the Intein Database") can be engineered immediately downstream from 
the displayed domain- Subsequent excision of intein cleanly releases the displayed domain 
from the carrier protein. 

The linkers may also be used to facilitate display of domains that would otherwise 
interfere with oligomeric phycobilisome assembly. The length and amino acid sequence 
requirements of such functionality are readily determined empirically for a given fusion 
construct. Generally, the linkers are preferably from at least 5, preferably at least 10 residues 
in length, typically requiring no more than 50, and more often no more than 30 residues. To 
facilitate an unintrusive orientation (see, e.g. Example A, below) small, flexible residues such 
as Ala, Gly and Ser are particularly convenient components. 

- " 7 
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The fusion proteins are incorporated in a functional oligomeric phycobiliprotein, see 
e.g. Glazer, 1994, Adv Mol Cell Biol 10, 1 19-149, comprising at least a dimer, preferably an 
a, B heterodimer. The oligomers, especially when assembled into higher-order structures 
such as trimers, hexamers, rods and phycobilisomes, constrain one displayed protein from 
interacting with another. This is particularly useful in producing proteins whose function is 
(1) harmful to the cell but (2) dependent on the formation of dimer or multimer. In a 
particular embodiment, the oligomeric fusion phycobiliproteins are structurally substantially 
identical to those of corresponding natural phycobiliproteins and preferably provide 
fluorescence quantum yield and molar extinction coefficients at least 1%, preferably at least 
10%, more preferably at least 50%, more preferably at least 75%, more preferably at least 
90% and most preferably substantially equivalent to those of corresponding natural 
phycobiliproteins. In other preferred aspects of this embodiment, the fluorescence emission 
spectrum of the oligomeric fusion phycobiliproteins is substantially equivalent to those of 
corresponding natural phycobiliproteins. - 

Because higher order oligomers (phycobilisomes) can form a complex with the water- 
splitting oxygen-evolving photosystem II, the surrounding environment of phycobilisomes, 
unlike virtually all other cellular environments, may be oxidative enough for spontaneous 
formation of disulfide bonds. Therefore phycobilisome display may be applied to the 
expression and folding of disulfide bond-containing proteins. Accordingly, the compositions 
also include functional oligomeric phycobiliproteins comprising the subject fusion proteins 
and cells comprising such oligomeric phycobiliproteins. 

The fusion proteins are expressible in any convenient system compatible with 
expression of the fusion protein, preferably an empirically, optimized host cell, host cell of the 
most closely related natural phycobiliprotein, or lysate or extract thereof. In a particular 
embodiment/the fusion protein is expressed at least 1%, preferably at least 10%, more 
preferably at least 50%, more preferably at least 75%, more preferably at least 90% and most 
preferably substantially equivalent to that of a corresponding unfused phycobiliprotein 
domain. A wide variety of expression systems may be used. For example, Anabaena cells 
expressing phycobilisome displayed proteins have normal phenotype and growth rates and 
can be grown large scale in defined inorganic salt media. Alternatively, cells or protoplasts 
may be engineered or reconstituted to express requisite lyase subunits (e.g. cpcE and cpcF, 

8 
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e.g. Fairchild et al., 1994, J Biol Chem 267, 16146-16154) and/or phycobiliprotein linker 
proteins (e.g. cpcC, e.g. Swanson et al., 1992, J Bact 174, 2640-2647) under conditions 
wherein the requisite oligomeric phycobilisome proteins are formed. Reconstituted 
heterologous cellular systems require either high levels of phycobiliprotein domain 
expression (sufficient to form dimers; see, Glazer et al., 1973, J. Biol Chem 248, 5679-5685) 
or the expression of a compatible heterophycobiliprotein domain (to form heterodimers), i.e. 
simply attempting to express a labeled phycobiliprotein fusion protein with a coexpressed 
lyase in a hererologous cell is insufficient to form the required oligomeric phycobiliproteins 
(Schroeder 1997, Phycobiliproteins: Biosynthesis and Applications,UC Berkeley, 
Dissertation). Alternatively, extracellular or cell free systems, such as lysates, extracts and 
reconstituted in vitro bilin addition systems may be used (e.g. Arciero et al. 1988, J Biol 
Chem 263; 18343-18349; 18350-18357; 18358-18363). Because phycobiliproteins and 
phycobilisomes are made at very high levels in the cell, especially under low light intensity, 
phycobiliprotein display also helps to increase the yield of the displayed protein. 

The subject methods include methods for making a functional displayed domain, the 
method comprising the step of combining a polypeptide comprising a displayed domain and a 
phycobiliprotein domain with a phycobiliprotein subunit under conditions to form a subject 
fusion protein. In particular embodiments, the methods further comprise prior to the 
combining step, the step of making the polypeptide by expressing a nucleic acid encoding the 
polypeptide; and/or after the combining step, the step of separating the functional displayed 
domain from the functional phycobiliprotein domain. The methods steps may occur 
intracellularly, e.g. in a cell which is or is a progeny of a natural cell which naturally makes 
functional phycobiliprotein. 

Our extensive studies on phycobiliprotein subunit fusion constructs demonstrate the 
versatility of phycobilisome display. In fact, our display concept may be practiced with other 
macromolecules, such as ribosomes, and such macromolecular complex display can also help 
in posttranslational modification of proteins. For example, proteins displayed on 80S 
ribosomes in eukaryotic cytosol can undergo different posttranslational modifications than 
proteins displayed on 70S ribosomes in the mitochondria and chloroplasts. Similarly, proteins 
displayed on different sides of the ER are also subjected to different modifications. 

9 
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EXAMPLES AND DETAILED EXPERIMENTAL PROTOCOLS 
Example A. Expression and characterization of recombinant phycobiliprotein fusion 
proteins. We describe here the expression of genes engineered to encode Anabaena sp. 
PCC7120 C-phycocyanin a and p polypeptides bearing a 24-residue N-terminal peptide tag, 
incorporating a block of six His residues (generally referred to as the "6xHis tag"), in wild- 
type and mutant cells of the filamentous cyanobacterium Anabaena sp. PCC7120. The 6xHis 
tag allows one-step purification of the recombinant polypeptides by immobilized metal-ion 
affinity chromatography [IMAC; (8)] away from the wild-type phycobiliproteins. We show 
here that phycocyanobilin (PCB) is correctly attached to the His-tagged phycocyanin a and P 
polypeptides and that they fold to yield molecules whose spectroscopic properties correspond 
to those of wild-type a and P polypeptides. 

Cultures and Strains -Escherichia coli strain DH5a (11) grown in LB medium was 
used in all cloning and expression experiments. Plasmid-encoded resistance to ampicillin or 
spectinomycin (Sp) was selected at an antibiotic concentration of 100 ^g ml" 1 . For 
overexpression of His-tagged proteins, isopropyl-p-D-thiogalactoside (IPTG) was added to a 
final concentration of 0.5 mM to exponentially growing cultures to derepress the trc 
promoter. Induced cultures were usually grown with vigorous aeration for 5 hr at 30°C to 
reduce formation of inclusion bodies. Induced cells were pelleted and stored at -20°C until 
use. 

Anabaena sp. strain PCC7120 and its derivative strains were grown at 30°C on 
modified AA minimal medium plus nitrate (12). under medium (75 uE nr 2 s" 1 ) to high (200 
uE nr 2 s- 1 ) light intensity provided by cool-white fluorescent bulbs. Anabaena cultures 
overexpressing His-tagged phycocyanin subunits were grown in half-strength AA plus nitrate 
medium (bubbled with 5% C0 2 in air, with 2.5 mM HEPES pH 9.0 buffer to maintain pH) to 
late-exponential phase before addition of IPTG to 0.5 mM. The cultures were induced for 2 to 
3 days before cells were harvested and stored at -20°C. Triparental mating to introduce free- 
replicating plasmids into Anabaena cells from E. coli was performed essentially as described 
(13). Strains bearing the plasmid-borne aadA gene, encoding resistance to streptomycin (Sm) 
and to Sp, were selected at 1 ug ml" 1 Sm plus 10 ug ml" 1 Sp on agar-solidified medium and 
at 20 ug ml" 1 Sp alone in liquid medium. Mutant strains B646, B6432S, and B64407 

10 
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generated by transposon insertion (7) were selected with neomycin sulfate (Nm) at 200 
ml" 1 on solid medium and 10 \ig ml" 1 in liquid medium. 

Cloning o/ Anabaena sp. PCC7120 C-Phycocyanin a and fi subunits - Standard 
procedures were used for most molecular biological manipulations (14). Genomic DNA was 
5 isolated from Anabaena sp. PCC 7120 as described (15). The cpcA and cpcB genes encoding 
the a and p subunits of phycocyanin, respectively, were amplified for genomic DNA by the 
polymerase chain reaction (PCR; 16). The 0.5-kb PCR fragment was digested with the 
restriction enzymes Ndel and HindHL, and cloned into Ndel- and ifz/irflll-digested cloning 
vector pUC19 [(17); GenBank accession No. X02514], giving plasmid pBSl 85. The 0.5-kb 
10 PCR fragment was digested with enzymes Ndel and EcoRI, and cloned into Ndel- and EcoRI- 
digested pUC19, giving plasmid pBS251. All fragments generated by PCR were sequenced to 
verify fidelity using the PRISM 373 DNA sequencing system and the dye-terminator cycle- 
sequencing kit from Applied Biosystems (Foster City, CA). DNA and amino-acid sequence 
analyses were performed with the programs Editbase (Purdue Research Foundation and 
15 USD A/ARS) and Lasergene (DNASTAR Inc., Madison, WI). 

In verifying the sequences of PCR products, two discrepancies were discovered near 
the end of the published cpcB DNA sequence from the same organism (5). Base number 487 
(numbering of the published sequence) is an A instead of a T. This changes the Cys codon 
(TGC) to a Ser codon (AGC). Another difference is within the tandem CTG repeats shortly 
20 before the stop codon. There are four such CTG triplets, rather than only three shown in the : 
published sequence. In addition to sequencing PCR-generated clones, the base changes were 
also confirmed by sequencing of a restriction fragment containing the entire cpcB gene. The 
last 36 base pairs of the cpcB gene sequence determined here, encompassing both corrections, 
encode a 12-amino acid sequence identical to that of another filamentous cyanobacterium, 
25 Mastigocladus laminosus [(18); GenBank accession No. M75599], and the sequence of the 
entire CpcB polypeptide aligns much better with those of other C-phycocyanin p subunits 
(19). The mass of the Anabaena sp. PCC7120 P subunit is in excellent agreement with that 
predicted from the corrected cpcB sequence. A 16.3-kb DNA sequence with the corrected 
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Construction of expression vectors - A family of plasmids was specifically 
constructed for inducible overexpression of His-tagged polypeptides in both E. coli and 
Anabaena sp. PCC7120. Plasmid pBS150v contains the following components: (A) From 0 
to 1.3 kb, a portion from pBR322 [(20); GenBank accession No. V01 119] that contains the 
ColEl oriV for plasmid replication in E. coli, the oriT {bom) site for conjugal transfer, and the 
rop (repressor of primer) gene. Although the first 11 codons of the Rop open reading frame 
were changed in pBS150v, the modified Rop protein apparently can still form the homodimer 
four-alpha-helix structure (21), helping to maintain the plasmid at a medium copy number 
like that of pBR322 (20); (B) From 1.3 to 2.3 kb, most of the C.S4 cassette (22) containing 
the engineered aadA gene that confers resistance to Sm and Sp; (G) From 2.3 to 4 kb, a 
portion from the expression vector pPROEX-1 (Life Technologies, Inc.) that contains the 
lacK gene coding for the lac repressor, and multiple cloning sites for construction of 
sequences coding for His-tagged proteins, expression of which is controlled by the trc 
promoter. The N-terminal 24 residues contain the 6xHis affinity tag as well as a 7-amino acid 
recognition and cleavage site for the tobacco etch virus (TEV) protease for removal of the 
6xHis tag if so desired (23, 24); and (D) From 4 to 4.6 kb, a portion from the expression 
vector pMAL-c2 (New England Biolabs, Inc.) that contains the lacZ a gene for blue/white 
screening of insert recombinants, and downstream, two strong, bidirectional, p-independent 
transcriptional terminator structures from the E. coli rrnB gene (25). The nucleotide sequence 
of pBS150v has been deposited in GenBank under the accession No. AF177932. 

A portion of the plasmid pDUl (26) coding for a replication protein and a resolvase 
was inserted in the unique EcoAlJR site of pBS150v, giving plasmid pBS150. This 3.7-kb 
fragment,. pDUlHC, contains spontaneous mutations that enable autonomous replication of 
cognate plasmids in higher copy number in Anabaena sp. (27). Plasmid pBS152v (sequence 
also available from GenBank under accession No. AF177933) is similar to pBS150v but 
sacrifices the lacZ a gene for more cloning sites like those in pPROEX-1 . The pDUlHC* 
version, pBS152, was constructed like pBS150. His-tagged constructs were usually made 
using the small, "v" versions of the plasmids and then converted to the larger, pDUlHC- 
containing form. When induced, the trc promoter initiates very strong transcription in both E. 
coli and Anabaena. While very tightly controlled in E. coli, the trc promoter is poorly 
repressed in Anabaena sp. partially due to the atypical promoter and the presence of some 
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codons unfavorable to Anabaena sp. in the lacN gene, leading to a relatively high level of 
constitutive expression of the 6xHis-tagged proteins. Preliminary Western analyses showed 
only about three-fold increase in production of His-tagged proteins in Anabaena cultures 
upon induction with IPTG. 
5 Isolation of phycobilisomes - Phycobilisome (PBS) preparations were characterized 

on discontinuous sucrose density gradients as described (28), with minor modifications. 
Frozen or freshly harvested Anabaena cells were used with no obvious difference in results. 
Na/K-P04 buffer (0.75M; 1:1 ratio of Na-P04 and K-PO4, pH 7.5) containing 1 mM each of 
NaN3 and ethylenediamine tetraacetate (EDTA), was used. Individual PBS, rod, and hexamer 
10 fractions collected from the sucrose gradients were dialyzed extensively against buffer 0 (20 
mM Tris-HCl pH 8.0, 50 mM NaCl, 50 mM KC1) to eliminate sucrose, phosphate, azide, and 
EDTA before isolation of His-tagged phycobiliproteins on a Ni 2+ -NTA column (see below). 
Similarly, for isolation of intact His-tagged PBS on Ni 2_f -NTA, the PBS fraction was dialyzed 
against 0.75 M phosphate buffer, pH 7.5 (without EDTA and azide) prior to loading on a 
15 Ni 2+ -NTA column (see below). 

To isolate PBS containing His-tagged phycocyanin subunits, the PBS preparation in 
0.75 M phosphate was loaded on 1 ml of Ni 2+ -NTA resin pre-equilibrated with 0.75 M 
phosphate buffer. The high concentration of phosphate, along with possibly lesser steric 
availability of the 6xHis tags of phycocyanin subunits incorporated within PBS, significantly 
20 decreased the rate of binding of PBS to the Ni 2+ -NTA resin. Several passages of the same . 

sample through the column were needed to get adequate binding. The resin was then washed 
with 20 column volumes of 0.75 M phosphate buffer plus 30 mM imidazole. PBS bound on 
resin were eluted with 0.75 M phosphate buffer plus 200 mM imidazole. The eluate was 
immediately dialyzed against 0.75 M phosphate buffer to remove imidazole. 
25 Isolation of His-tagged Proteins by Immobilized Metal Affinity Chromatography - • 

Cell pellets were thawed and resuspended in 10 to 20 volumes of cold (0-4°C) buffer 0. 
Phenylmethylsulfonyl fluoride (PMSF) and p-mercaptoethanol were added to give final 
concentrations of 1 mM and 10 mM, respectively, immediately before breakage of cells by 
passage through a French press cell, three times at 18,000 p.s.i. Cellular debris was then 
30 removed by centrifugation at 4°C in an angled rotor at 30,000 xg for 20 min (preparations 

from E. coli) or at 130,000 xg for 1 hr (preparations from Anabaena sp.). The supernatant was 
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loaded on a column of 1 to 3 ml of Ni 2+ -NTA resin pre-equilibrated with buffer 0. The resin 
was then washed consecutively with ten column volumes each of cold buffer Al (20 mM 
Tris-HCl pH 8.0, 50 mM NaCl, 50 mM KC1, 20 mM imidazole, 5% v/v glycerol), buffer B 
(20 mM Tris-HCl pH 8.0, 500 mM NaCl, 500 mM KC1), and buffer A2 (20 mM Tris-HCl pH 
8.0, 50 mM NaCl, 50 mM KC1, 30 mM imidazole). His-tagged proteins were eluted from the 
resin with buffer C (20 mM Tris-HCl pH 8.0, 50 mM NaCl, 50 mM KC1, 200 mM 
imidazole). The eluate was immediately dialyzed against 50 mM Tris-HCl pH 8.0, 50 mM 
NaCl, 50 mM KC1, 1 mM DTT, to remove imidazole which at high concentrations is slightly 
denaturing to proteins. Recombinant His-tagged phycocyanin isolated in this manner 
consisted mostly of ap heterodimers. A crude estimate of the yield of His-tagged 
phycocyanin was calculated from the amount of phycobiliprotein (expressed in units of 
A620nm nil) eluted with buffer C vs. the total amount of phycobiliprotein applied to the Ni 2+ - 
NTA resin. 

Size Exclusion Chromatography-High Performance Liquid Chromatography - 
Although IMAC-purified and dialyzed samples (see above) could be used with no obvious 
effect on the results, proteins to be analyzed by SEC-HPLC were generally dialyzed again in 
50 mM Na-PC>4 pH 7.0, 50 mM NaCl, 0.5 mM DTT. SEC-HPLC separations were performed 
with a Waters 600 pump and line detection achieved with a Waters 991 photodiode array 
detector (Waters Associates, Milford, MA). A mobile phase of 50 mM Na-P04 pH 7.0 was 
flowed at 0.8 ml inkr 1 through Bio-Sil 250 (Bio-Rad Laboratories, Hercules, CA) guard 
column (80 x 7.8 mm) and column (300 x 7.8 mm) maintained at 25°C. For analytical 
separations, 50 to 200 jug of phycocyanin was loaded [based on absprbance at the peak near 
621 nm and using a molar extinction coefficient of 290,000 M" 1 cm" 1 per monomer (29)]. 
Molecular size calibration utilized bovine gamma globulin, 158 kDa; chicken ovalbumin, 44 
kDa; horse myoglobin, 17 kDa; vitamin B12, 1.35 kDa (Bio-Rad gel filtration standards). The 
void volume was determined with blue dextran, 2000 kDa (Pharmacia Biotech, Piscataway, 
NJ). Preparative separations were performed with loads of 0.5 to 1 mg phycocyanin. 

Analytical Ultracentrifugation - Measurements were performed on a Beckman XLA 
centrifuge using an An-60Ti rotor maintained at 20°C and 20,000 rpm. Affinity-purified 
samples were run at a concentration of 0.5 mg ml' 1 in dialysis buffer. Data were fit to a two 
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component regression (which was superior to a single-component fit) and the molecular 
weights determined with the HED4000 software. 

SDS-Polyaciylamide Gel Electrophoresis - Proteins were precipitated with 10% (w/v) 
trichloroacetic acid, washed once with ice-cold water, and resolubilized in SDS loading 
5 buffer [2% sodium dodecyl sulfate, 50 mM Tris-HCl, pH 6.8, 100 mM DTT, 0.1% 

Bromphenol Blue, and 10% v/v glycerol; (13)]. SDS-PAGE (30) was performed using 10% 
acrylamide stacking and 14% separating gels, with monomer/bis ratio of 37.5:1. Bilin-bearing 
polypeptides were visualized as fluorescent bands by UV (> 312 nm) illumination after 
staining with 10 mM zinc acetate (31). Polypeptides were also visualized by staining with 
10 . Coomassie Brilliant Blue. Protein molecular weight standards were purchased from Bio-Rad 
Laboratories. 

Absorbance and Fluorescence Spectrometiy - Absorbance spectra were acquired on a 
computer-controlled, dual-beam X6 UV/Vis spectrophotometer (Perkin-Elmer Corp., 
Norwalk, CT). Absorbance of Anabaena cell suspensions was measured from 400 to 750 nm 

15 with a light beam passing through the frosted side of a 1 -cm light path square cuvette. 

Fluorescence measurements were performed, on samples with a maximal absorbance of less 
than 0.07 to minimize the inner-filter effect, with a Perkin-Elmer MPF-44B fluorimeter 
coupled with an I/O board to a Macintosh computer for digitization and storage of data. 
Excitation and emission slits were set at 5 or 6 nm for all measurements. Excitation spectra 

20 were measured with emission observed at a wavelength 10 nm to the red of the sample's 
fluorescence emission maximum. 

Fluorescence quantum yield measurements, for samples isolated from SEC-HPLC, 
were made relative to cresyl violet (in ethanol, Of = 0.59; Eastman Kodak Co., Rochester, 
NY) using the K2 multifrequency phase and modulation fluorimeter (ISS, Champaign, IL) 

25 with excitation and emission slits set at 8 nm. All samples were diluted to peak absorbance 
between 0.01 and 0.05, excited at 570 nm and emission acquired from 575 to 800 nm. These 
emission spectra were then instrument-corrected, converted to wavenumber scale, and 
bandpass corrected by multiplying emission intensity at each wavelength by the square of the 
respective wavelength (32). The resulting spectra were integrated and quantum yields 

30 calculated according to previously described equations (33). 
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Molar extinction coefficients were determined according to previously established 
methods (34). In brief, absorbance spectra of samples isolated as trimers (unless otherwise 
noted) from SEC-HPLC were measured. These samples, having peak absorbance between 0.9 
and 1.2, were diluted lOx with 8 M urea, pH 1.9 containing 10 mM DTT. The absorbance 
was measured at 660 nm for these denatured samples, and molar extinction coefficients were 
calculated for the original native samples from the known value of 35,400 M' 1 cm- 1 (at 660 
nm) for each peptide-bound PCB in acid urea (35). Calculations assumed a stoichiometry of 
one PCB per a subunit and two PCBs per (3 subunit. Molar extinction coefficient of each 
peptide-bound PCB" at 280 nm was measured using trimeric phycocyanins purified from SEC- 
HPLC. Samples in 50 mM Na-PC>4 pH 7.0, with peak absorbance between 1.0 and 1.5 at 620 
nm, were denatured with 9 M urea to give final 8 M urea, pH 2.0. The extinction coefficient 
of a protein-bound PCB at 660 nm was identical to the value mentioned above, while that at 
280 nm was calculated by subtracting contributions from Tyr [S28O nm = 1,370 M' 1 cm" 1 ; 
(36)] and Trp [S 2 80 nm = 5,500 M" 1 cm" 1 ; (37)] residues. 

Mass Spectrometry - For electrospray mass spectrometry, purified proteins (0.5 to 1 
mg) were dialyzed extensively against 10 mM ammonium acetate prior to lyophilization. 
Proteins redissolved in 50% acetonitrile in 0.2% aqueous formic acid were analyzed with a 
VG Bio-Q mass spectrometeras described (38). Mass spectral analysis of different 
phycobiliprotein polypeptides, as well as of some other proteins expressed in Anabaena sp., 
showed quantitative posttranslational removal of the N-terminal Met in Anabaena sp. 
PCC7120. For this reason, amino acid residue numbering starts from the residue after the 
initial Met, and follows that used for numbering of bilin-linked residues in crystallographic 
studies of the highly homologous phycocyanin from Mastigocladus laminosus (39). 
However, our examination of phycobiliproteins from other cyanobacteria, as well as amino 
acid sequence data published by others (40, 41), showed that such posttranslational 
processing is not universal to cyanobacteria. 

Molecular modeling of His-tagged C-phycocyanin - The crystal structure of C- 
phycocyanin from the cyanobacterium Fremyella diplosiphon has been solved at 1.75 A 
resolution (42). The structural data for that phycocyanin, which is highly homologous to that 
of Anabaena sp. PCC7120, were used to build a molecular model of the Anabaena C- 
phycocyanin with the 24-amino acid N-terminal extension on the a subunit. Examination of 
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the C-phycocyanin hexamer structure revealed a groove lying on the p face between p 
subunits. In order to present a reasonable graphical representation of the HTcc:P model, we 
chose to lead the 24-amino acid extension from inside to outside of the ring structure of 
phycocynin trimer through this surface groove. Although structural data supporting this 
choice is lacking, our results suggest that this is a plausible placement. Modeling of the 24- 
residue extension onto the C-phycocyanin hexamer was performed using the published 
monomer coordinates [(42); PDB accession code: Icpc]. The 24-residue tag was built onto 
the N-terminus of the a subunit as follows. First, the coordinates for a well-defined 25- 
residue peptide (the last one at C-terminus being Met) were excised from the published 
coordinates of another protein (antibody 48G7; PDB accession code: 1GAF). Next, all side 
chains except for that of the last Met were truncated to Ala (or left as Gly). Using the least- 
squares superposition of the program LSQMAN (43), the C-terminal Met of the 25-mer 
peptide was overlaid with the N-terminal Met of the a subunit . The tag was then coarsely 
laid into the above-mentioned groove using the torsion commands of the program O (44), and 
the side chains mutated again to conform with sequence of the 24-residue tag of HTa. Again, 
using only the torsion commands, phi, psi, and chi angles on the 24-residue tag were 
manipulated so as to conform to the following constraints: (a) all non-bonded contacts 
involving tag-to-tag and tag-to-phycocyanin atoms must be in excess of 2.5 A; (b) large part 
of the tag must lay more or less in the p-face groove; and (c) the six His residues must place 
externally to the hexamer so as to be solvent-accessible. The model was then subjected to 200 
rounds of Powell energy minimization using the program XPLOR (45, 46), during which all 
of the original C-phycocyanin atoms were constrained from movement. This resulted in the 
torsion angles and non-bonded contacts of the 24-residue tag approaching canonical values. 
Quality of the model with respect to phi/psi angles was verified using the program 
PROCHECK (47). The Ramachandran plot showed that the modeled tag contained no 
disallowed phi or psi angles. To build trimers, symmetry operators were applied to the HTa:P 
monomer model using program O. The hexamer model was built by adding the tag 
coordinates to the second a subunit coordinates of the lcpc structure and then applying the 
symmetry operators. 

Nomenclature - A His-tagged protein preparation purified by IMAC may contain 
more than one protein species. For example, both the apoprotein and holoprotein version of a 
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His-tagged phycocyanin subunit may be present. Consequently, we refer to such a preparation 
by a nomenclature which specifies the organism in which it is being expressed (i.e., 
Anabaena sp. or E, coli\ and the number of the pBS plasmid which contains the gene 
encoding the recombinant subunit. Thus, when His-tagged phycocyanin a subunit is 
5 expressed in Anabaena sp. PCC7120 bearing the plasmid pBS 168, the fraction purified by 
IMAC is designated Anl68. A fully characterized fraction, shown to consist of a single 
species, for example, pure His-tagged phycocyanin a subunit, is designated HTa. A mutant 
polypeptide, for example, a His-tagged Anabaena sp. PCC7120 phycocyanin a subunit with a 
Thr residue in place of an Ala residue at position 12, is designated HTa A12T . The absence or 
10 presence of the bilin chromophores is indicated by the prefixes "apo" and "holo", 

respectively. A His-tagged a subunit with native p subunit non-covalently associated is 
designated HTa: p. 

Table 1 summarizes the properties of each of the Anabaena sp. PCC7120 wild-type 
. and derivative strains and of expression vectors carried by these strains used for the 
15 production of His-tagged C-phycocyanin normal and mutant a and p subunits. 

Expression of His-Tagged C-Phycocyanih a Subunit in Wild-Type Anabaena sp. - The 
0.5-kb Ndel-cpcA-HindUl fragment from pBS185, encoding the wild-type phycocyanin a 
subunit, was cloned into pBS150, giving plasmid pBS168 encoding the a subunit with a 24- 
residue N-terminal extension. Cultures of Anabaena sp. PCC7120(pBS168) expressing 
20 6xHis-tagged phycocyanin a subunit were very similar to wild-type cultures grown under 

identical conditions with respect to the relative amounts of phycobiliproteins and chlorophyll, 
and showed no apparent negative phenotypic characteristics. 

Isolation and characterization of HTa - When cell lysate supernatant from Anabaena 
sp. PCC7120(pBS 1 68) was passed through a Ni 2+ -NTA affinity column, some 30 to 40% of 
25 the cell phycobiliprotein (as estimated from A max at 620 nm) was retained on the column and 
then eluted with imidazole, a His-tag competitor. SDS-PAGE analysis of the His-tagged 
protein fraction showed two bands of 21.9 kDa and 19.5 kDa, corresponding to the calculated 
molecular weights of His-tagged C-phycocyanin a and native p subunits, respectively. Upon 
Zn 2+ -staining, both bands showed red fluorescence with near-UV excitation, indicating that 
30 each carried covalently linked phycocyanobilin. Identification of die two polypeptides was 

confirmed by mass spectrometry, which showed two components of 20,79 1.4±2.3 (holo-HTa 
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minus the N-terminal Met) and 19,441±1.3 (holo~p minus the N-terminal Met) in a 1:1 ratio. 
These mass values confirm the presence of one PCB on HTa and two PCBs on the p subunit. 
No other significant peaks, especially that corresponding to an apo-HTa, were present in the 
mass spectrum. These data indicate that the protein purified by IMAC, Anl68, is 
phycocyanin holo-HTa:holo-p. 

The assembly forms of phycocyanin holo-HTa:holo-p and their apparent molecular 
weights were determined by analytical SEC-HPLC. As shown in Table 2, three components 
were present under our particular chromatographic conditions, a monomer (calculated mass 
40.5 kDa) at 49.2 kDa, a trimer (calculated mass 121.5 kDa) at 145.3 kDa, and a hexamer 
(calculated mass 243 kDa). Over 90% of the protein was trimeric, (HTa:p)3, the major 
assembly state observed under these conditions for native PC isolated from wild-type 
Anabaena sp. PCC7120. The SEC-HPLC data were corroborated by analytical 
ultracentrifugation, where most of the HTa:p was found to have a mass of 125.9 kDa, and the 
balance a mass of 34.8 kDa (Table 2). All three fractions from SEC-HPLC were shown by 
SDS-PAGE to contain holo-HTcc and holo-p in a ratio of 1:1. Thus, the Anl68 His-tagged 
protein preparation purified by IMAC consists of HTa: p, and displays aggregation behavior 
typical of native PC under similar in vitro conditions. 

(HTa:p)3 isolated by SEC-HPLC had a >u max of 621 nm at protein concentrations >0.5 
mg ml' 1 , and gave spectra with blue-shifted maxima (to as low as 615 nm) upon dilution 
(Table 3). This is consistent with the blue shift in X max of wild-type phycocyanin when higher 
order assemblies dissociate to monomers as the protein concentration is lowered (48). 
Interestingly, at high protein concentrations, HTa:p displays a A, max slightly red-shifted 
relative to that of native PC measured at similar concentrations. The s M of (HTa*.p)3 was 
determined to be 9.21xl0 5 M" 1 cm" 1 at 621 nm, similar to values observed for native 
phycocyanin from Anabaena sp. and other cyanobacteria [Table 3; (49)]. The excitation 
spectrum for 650 nm emission coincided nearly perfectly with the absorbance spectrum, 
indicating that the HTa:P preparation behaved as a single species with respect to 
spectroscopic properties. The Awas at 642 nm (with 560 nm excitation) with a 0> F of 0.22, 
similar to a value of 0.27 measured for native Anabaena PC (Table 3). In summary, the 
spectroscopic characteristics of HTa: P are virtually identical to those of native Anabaena sp. 
PCC7120 C-phycocyanin. 
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Incorporation ofHTa into Phycobilisomes - Since holo-HTa :holo-p represents 
between 30 and 40% of the total phycobiliprotein in Anabaena sp. PCC7120(pBS168), it 
appeared likely that holo-HTa is effectively incorporated into phycobilisomes. This 
possibility was examined by analyzing phycobilisomes and their partial dissociation products 

5 for the presence of holo-HTa. Compared to the wild-type Anabaena sp. PCC7120 

phycobilisome preparation, in the Anabaena sp. PCC7120(pBS168) preparation a lower 
percentage of the total phycobiliproteins was found in the phycobilisome fraction, while more 
was found in the rod substructure and the hexamer fractions. This indicates that the 
phycobilisomes of the strain expressing holo-HTa are less stable under the conditions used 

10 for phycobilisome preparation. SDS-PAGE analysis showed that holo-HTa was present in all 
three fractions. 

Anabaena sp. PCC7120(pBS168) phycobilisomes loaded on a Ni 2+ -NTA column (in 
the 0.75M Na/K-PC>4 pH 7.5 buffer needed to maintain phycobilisome integrity, see 
"Materials and Methods"), bound to the column, although the binding capacity of the column 
1 5 under these conditions appeared low. The His-tagged phycobilisomes were eluted at high 

imidazole concentration. After removal of imidazole by dialysis, the SDS-PAGE polypeptide 
profile and spectroscopic properties of this phycobilisome fraction were very similar to that of 
the starting phycobilisome preparation. These results show that the 6xHis tags of the HTa 
subunits incorporated into phycobilisomes are relatively exposed, available for interaction 
20 with the Ni 2+ -NTA matrix. 

Expression of His -Tagged C-Phycocyanin J3 Subunit in Wild-Type Anabaena sp. - The 
0.5-kb Ndel-cpcB-EcoRI fragment from pBS251, encoding the wild-type phycocyanin p 
subunit, was cloned into pBS150, giving plasmid pBS262 -encoding the p subunit with the 24- 
residueN-terminal extension. Cultures of Anabaena sp. PCC7120(pBS262) expressing HTp 
25 were phenotypically very similar to those of Anabaena sp. PCC7120(pBS168) expressing 

HTa, except for a slightly yellowish appearance. Their whole-cell absorbance spectra showed 
a small decrease in the amount of phycobiliprotein per cell relative to chlorophyll. 

The yield of the His-tagged phycobiliprotein fraction from Anabaena sp. 
PCC7120(pBS262) was 16-18% of the total phycobiliprotein in the cell lysate, substantially 
30 lower than the 30-40% in the corresponding fraction from Anabaena sp. PCC7120(pBS 1 68) 
expressing HTa (see above). Analysis of the His-tagged phycobiliprotein preparation, An262, 
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by SDS-PAGE showed the presence of holo-HTj} and holo-a in equal amounts. Mass spectral 
analysis showed that the two subunits carried the normal amounts of PCB, i.e., one PCB per 
a and two per HTP subunit, and no apo-polypeptides were present. 

In analytical SEC-HPLC, the a:HTp holoprotein preparation isolated by IMAC was 
5 primarily trimeric (calculated mass 121 .5 kDa) at 136.9 kDa. The shoulder/tail of that peak, 
constituting less than 5% of the total His-tagged protein loaded, could be attributed to 
monomers (calculated mass 40.5 kDa) at 62.4 kDa (Table 2). Both fractions were shown by 
SDS-PAGE to contain holo-a and holo-HTp in a molar ratio of 1 : L In contrast to native PC 
and the HTcc:p holoprotein, the hexamer component was not observed for the cc:HTp 
10 holoprotein. Analytical ultracentrifugation data were consistent with two components 

observed in SEC-HPLC: a majority of the ot:HTP holoprotein had a mass of 121 .9 kDa, and a 
small fraction of 43.4 kDa, corresponding closely to the theoretical masses of trimers and 
monomers, respectively. 

The of the trimeric a:HTp holoprotein component lay between 615 and 619 nm 
15 depending on protein concentration (Table 3). As observed with HTa:P, 7^ Tmx shifted blue 

upon dilution. However, at high protein concentration the A™a X of the a:HTp holoprotein, like 
that of native PC, did not exceed 619 nm, in contrast to the of 621 nm observed for 
HTa:p under similar conditions. The 8m of a:HTp trimers was found to be 8.88xl0 5 M" 1 cm* 
1 at 619 nm (Table 2), a slightly lower value than that observed for HToc-.p. The excitation 
20 spectrum for 650 nm emission corresponded nearly perfectly to the absorbance spectrum, 
indicating that the a:HTp preparation behaved as a single species with respect to 
spectroscopic properties. The A,was at 642 nm (with 560 nm excitation), with a 3> F of 0.23 
(Table 2). Thus the spectroscopic characteristics of a:HTP are virtually identical to those of . 
native Anabaena sp. PCC7120 C-phycocyanin and its HTa:P counterpart. 
25 A sucrose density gradient preparation of phycobilisomes from Anabaena sp. 

PCC7120(pBS262) expressing HTP showed distribution of phycobiliprotein aggregates 
different from that of the wild-type cells. A much smaller proportion of the phycobiliprotein 
(25%) sedimented in the phycobilisome fraction, with 40% and 35% in the rod and hexamer 
fractions, respectively. SDS-PAGE analysis showed that HTP was present in all three 
30 fractions. However, less HTP (relative. to the other phycobiliproteins) was found in 

phycobilisomes compared to the amount found in the products of phycobilisome dissociation, 
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rods and hexamers, suggesting a reduced stability of phycobilisomes that have incorporated 
more HTp subunits. As described above for Anabaena sp. PCC7120(pBS168) expressing 
HTa, His-tagged phycobilisomes could be isolated from the Anabaena sp. 
PCC7120(pBS262) phycobilisome fraction by IMAC, although in a relatively poor yield. The 
5 low yield might be due to the smaller number of HTp subunits present per phycobilisome, a 
reflection of the smaller amount of holo-HTP relative to holo-p produced per cell. 

Highly purified (HTa:P>3 and (a:HTp)3 phycocyanins were used to measure the 
extinction coefficient of peptide-bound PCBs in 8 M urea pH 2.0. Two different preparations 
were measured for each protein, and four measurements gave very similar values. The molar 
10 extinction coefficient for one peptide-bound PCB at 660 nm is very close to the value of 
35,400 M 1 cm" 1 (35). Upon subtraction of contributions from Tip and Tyr residues, the 
extinction coefficient for one peptide-bound PCB at 280 was determined to be 
(14.85±0.1)xl0 3 M" 1 cm" 1 (Table 3). 

Expression of HTa and HT/3 in a cpcB AC backgi-ound - The separate overexpression 
15 of HTa and HTp in phycocyanin-minus backgrounds was examined in mutant strain B646 
(7). This mutant has a transposon inserted between the promoter and the CpcB open reading 
frame, which is expected to eliminate transcription of cpcB AC, genes encoding the p and a 
subunits of phycocyanin and a rod linker polypeptide, respectively (5). Strains B646(pBS168) 
and B646(pBS262) (see Table 1), generated by introduction of pBS168 and pBS262 into 
20 strain B646 by conjugation, were phenotypically indistinguishable from the parent mutant 
. strain. Both phycocyanin-deficient strains grow slowly and produce very low levels of the 
His-tagged polypeptides. 

SDS-PAGE analysis of the His-tagged phycobiliprotein fractions from strain 
B646(pBS262) showed no a or P subunits of wild-type phycocyanin. However, multiple 
25 attempts at purifying the HTa from strain B646(pBS168) yielded fractions containing some 
native P subunit, presumably resulting from a basal leaky transcription of the cpcBAC operon. 

On SEC-HPLC fractionation, the IMAC-purified proteins from B646(pBS262), 
An262-BAC, eluted as a single peak of subunit homodimers, i.e., (HTph- Purified native p 
subunits of C-phycocyanin dimerize at concentrations of 1-10 juM (34, 50). Since mass 
30 spectrometry showed that the HTp subunits produced in this cpcBAC background carried a 
full complement of PCB, it is evident that bilin addition to phycocyanin p subunit is 

22 



WO 01/46395 



PCT/IJS00/35028 



independent of the presence of the phycocyanin apo- or holo-ot subunit. The mass spectral 
analysis also indicates the presence in HTp of the posttranslational methylation at the y-amino 
group of the Asn 72 residue normally found in phycocyanin p subunits (51, 52). 

Under similar SEC-HPLC conditions IMAC-purified protein preparation from 
B646(pBSl68), Anl68-BAC, fractionated into three species: trimer, monomer, and subunit. 
SDS-PAGE analysis of the fractions indicated that the trimer and monomer fractions 
contained HTa and holo-p in equimolar amounts. The subunit fraction contained only HTa, 
the majority of which appeared to be apo-HTa based on the relative intensities in the 
absorption peaks at 280 nm and 618 nm, and from comparison of Zn 2+ - and.Coomassie- 
staining intensities of the Anl68-BAC trimer and monomer fractions (HTa:P) run in parallel 
on the same gel. 

Analyses of the His-tagged protein preparations from strains B646(pBS168) and 
B646(pBS262) indicate that overexpression of HTp in the cpcBAC background yields a 
soluble protein where the HTp subunit is stabilized by homodimer formation. The results also 
indicate that HTa appears to be unable to form subunit homodimers in the cell, and may 
require the presence of P subunit that is extremely limited in the cpcBAC background for 
stabilization. The greatly reduced chromophorylation of HTa protein isolated from this 
cpcBAC strain may also reflect a possible requirement of subunit dimerization (a.*p, a:a, or 
p:p) prior to covalent attachment of PCB, or misfolding of a significant fraction of apo-HTa 
in the absence of apo- or holo-p. 

The HTa and HTp subunits isc>lated from mutant strains B646(pBSl 68) and 
B646(pBS262) have absorbance spectra and molar extinction coefficients (Table 3) similar to 
those reported for renatured a and P subunits of native C-phycocyanin, obtained by 
chromatographic separation under denaturing conditions (34, 50). Algebraic addition of the 
HTa and HTp spectra yielded a spectrum with shape similar to that of monomelic PC but 
with a blue-shifted maximum at 611 nm, consistent with previous observations (34). 
Incubation of the HTa and HTp subunits together overnight at 4°C, followed by fractionation 
by SEC-HPLC, yielded a trimer fraction with absorbance and fluorescence excitation and 
emission spectra similar to the corresponding spectra of HTa:p 5 a:HTp,.and native PC. This 
indicates that holo-HTa (but not apo-HTa, see below) interacts preferentially with holo-HTp 
to form trimers, and His-tagged holophycocyanin subunits isolated in the absence of their 
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cognate subunit partners can be reconstituted to yield trimers with properties similar to those 
of native phycocyanin. 

Expression of His-tagged phycocyanin subunits in an apophycocyanin a subunit PCB 
lyase-deficient background - A dimeric lyase, CpcEF encoded by genes cpcE and cpcF, 
5 specifically catalyzes the addition of PCB to Cys 84 of the phycocyanin a subunit (1,3). We 
have obtained two mutants through transposon mutagenesis, B64328 with an insertion in the 
cpcE gene, and B64407 in the cpcF gene. Both mutants had greatly reduced cellular content 
of normal phycocyanin (7). Expression of His-tagged phycocyanin subunits in the mutant 
strains provided a simple way of assessing how mutations in each of the two CpcEF lyase 
1 0 subunits affect PCB addition to the phycocyanin a subunit, and how cognate PC proteins 
behave in the cell. 

Expression ofHTa in either a cpcE or a cpcF backgi-ound - To express the HTa 
polypeptide in either a cpcE or cpcF background, plasmid pBS168 was introduced into both 
mutants, giving strains B64328(pBS168) and B64407(pBS168) (see Table 1). The two strains 

1 5 showed the yellowish-green phenotype characteristic of their respective parent mutants. The 
His-tagged phycobiliprotein fraction from either mutant strain amounted to < 5% of the total 
absorbance of the cell lysate at 607 nm. The His-tagged protein purified from the cpcF strain 
B64407(pBS168), Anl6S-F, was shown by SDS-PAGE and mass spectral analyses to be apo- 
HTcc:holo-p, with no detectable holo-HTa. In contrast, the Anl68-E protein contained apo- 

20 HTa:holo-p along with a small amount of holo-HTarholo-fi. On SEC-HPLC, apo-HTa:holo- 
p is seen to form monomers preferentially, in contrast to the preferred trimer formation by 
holo-HTa:holo-p. 

Sucrose density gradient sedimentation patterns of phycobilisome preparations from 
strains B64328(pBS168) and B64407(pBS168) were very similar to those given by the 

25 parental strains B64328 and B64407 (7). In both cases, the phycobilisome fraction consisted 
largely of the allophycocyanin-containing phycobilisome cores carrying much reduced 
amounts of rod components (relative to phycobilisomes from wild-type cells). SDS-PAGE 
analysis showed the presence of apo-HTa in the phycobilisome and rod/hexamer fractions 
and an amount of phycoerythrocyanin, a distal component of Anabaena phycobilisome rod 

30 substructures, greater than that seen in wild-type phycobilisomes (7, 53). The relative amount 
of apo-HTa was higher in the phycobilisome fraction than in the rod/hexamer fraction, 
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suggesting that apo-HTa assembled into phycobilisomes might be turned over more slowly in 
the cell. 

The finding that apo-HTa:holo-(3 assembles into phycobilisomes is consistent with 
earlier observations of such incorporation of the phycocyanin apo-oc subunit in cpcE and cpcF 
5 mutants of the unicellular cyanobacterium Synechococcus sp. PCC7002 (3, 4). We showed 

previously that the Anabaena sp. PCC7120 mutant strains B64328 (cpcE) and B64407 (cpcF) 
incorporated the phycocyanin apo-a:holo-P into phycobilisomes, and that this "defective" 
phycocyanin still allowed the assembly into phycobilisomes of the distal phycoerythrocyanin 
component (7). The experiments reported here extend that observation to phycocyanin apo- 
10 HTa:holo-p proteins. 

Expression of HT(3 in either a cpcE or a cpcF background - To express the HT(3 
polypeptide in either a cpcE or cpcF background, plasmid pBS262 was introduced into both 
mutants, giving strains B64328(pBS262) and B64407(pBS262) (see Table 1). Yield of His- 
tagged holophycobiliproteins from either mutant strain was low, accounting for < 7% of the 
15 absorbance of the cell lysate at 607 nm. In contrast to results obtained with the mutant strains 
expressing HTa (see above), when HT(5 was expressed, SDS-PAGE analysis showed that the 
His-tagged protein preparations contained apo-a and holo-HT(3 in a molar ratio of 
approximately 1:2. SEC-HPLC fractionation showed presence of apo-a:holo-HT(5 trimers 
and monomers, and of (holo-HTp)2 homodimers. The apo-a:holo-HT(3 trimers had 
20 spectroscopic properties virtually identical to those of apo-HTa :holo- (3 trimers described 
above. Analyses of phycobilisomes and of rod/hexamer fractions showed that some of the 
holo-HTp (likely in the form of apo-a:holo-HTp) was incorporated into both fractions. 

Expression of His-tagged mutant phycocyanin subunits - The favorable expression 
and incorporation of His-tagged wild-type phycocyanin subunits in Anabaena sp. provides a 
25 convenient means to perform in vivo and in vitro studies of mutant phycobiliprotein subunits. 
Here we present two examples. 

Expression ofHTa A12T - One of the cpcA clones generated by PCR using the Tag 
DNA polymerase was found to have a single G -> A base change leading to an Ala 12 -> Thr 
mutation in the translated protein. The Ala 12 residue is conserved in all sequenced 
30 phycocyanin a subunits (19). The crystal structure of phycocyanin predicts that replacement 
of Ala 12 by a residue with a larger side chain would interfere with ap heterodimer formation 
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by steric hindrance of the interaction of a-Asp 13 with (3-Tyr 97 (54). We expressed the 
HTa AI2T mutant subunit in wild-type Anabaena sp. PCC7120 to examine this prediction. 

The mutant cpcA gene was cloned into pBS150, giving plasmid pBS167. Anabaena . 
sp. PCC7120(pBS167) expressing HTa Ai2T was phenotypically very similar to strain 

5 Anabaena sp. PCC7120(pBS168) (see Table 1). However, the His-tagged protein fraction 
was obtained in very poor yield from Anabaena sp. PCC7120(pBS167): < 0.15% of the 
phycobiliproteins in the cell lysate as estimated from A620 ran- While spectroscopically 
similar to the Anl68 His-tagged protein preparation, mass spectral analysis of the Anl67 His- 
tagged protein preparation showed two components in similar amounts, with masses 

10 corresponding to those of holo-cc and holo-HTcc A52T . These results lead to two conclusions. 
First, the mutant HTa Al2T subunit appears to have a greatly reduced affinity for the holo-p 
subunit. Most of the HTot A12T is likely to be present as free subunits in the cell and to be 
degraded rapidly. HToc A12T also seems to have a low affinity for holo-a, and is consequently 
stabilized in vivo by the formation of holo-a :holo-HTa A12T dimers. Second, the in vivo 

15 covalent attachment of PCB to the phycocyanin a subunit evidently does not require that it 
form a complex with the p subunit. 

Sucrose density gradient preparation of phycobilisomes from strain Anabaena sp. 
PCC7120(pBS167) gave a banding pattern similar to that from the wild type. The HTa A12T 
subunit was not detected in phycobilisome and rods fractions, suggesting that HTa A12T was 

20 not assembled into the phycobilisomes (although the very low amount of HToc A12T may have 
evaded detection),. 

Expression ofHT{fi 46G > N76D - One of the cpcB fragments, PCR-amplified using the 
Tag DNA polymerase, contained two mutations: both an A->G change, in bases number 139 
and 229, respectively, of the published sequence (5). The changes resulted in the replacement 

25 of residue Ser 46 with Gly, and of residue Asn 76 with Asp in the CpcB protein. Anabaena sp. 
PCC7120(pBS162) expressing the HTp S46G > N76D mutant PC subunit was phenotypically 
identical to the one expressing His-tagged wild-type p subunit. The HTP S46G > N76D mutant 
protein behaved the same way as the His-tagged wild-type counterpart in all aspects tested 
(yield, assembly states in vitro, assembly into phycobilisome in vivo, etc.) except that the 

30 purified protein, Anl62, had an absorption maximum at 610 nm, about 8-nm blue- shifted 

26 



WO 01/46395 PCT/US00/35028 

compared to that of the An262 protein. Fluorescence emission maximum of the Anl62 
protein, however, was still at 642 nm. 

Ser 46 of the phycocyanin P subunit, situated in the coil region connecting helices A 
and B (39, 54), is not in contact with any chromophore. Its non-conserved substitutions 
5 include Gly in some phycobiliprotein subunits. It therefore may be assumed that the Ser 46 -> 
Gly change would not have obvious effects on the protein. The Asn 76 residue, on the other 
* hand, is highly conserved among multi-chromophore p subunits of phycobiliproteins (19), 
" and its sidechain is in contact with ring D of the a-84 PCB chromophore of PC trimers, 
possibly involved in maintaining its stretched confirmation in trimer aggregates (55). The 
1 0 neighboring Arg 77 residue, also highly conserved, forms a hydrogen bond with the propionic 
acid substituent of ring B of the p-84 PCB (54, 55). In the mutant protein, replacement of 
Asn 76 by Asp 76 sidechain may interfere with that hydrogen bonding, thereby affecting 
absorbance of P-84 PCB. A reduction of absorbance of P-84 PCB would shift the X max of the 
cognate phycocyanin to the blue (50). 
15 Molecular modeling of His-tagged C-phycocyanin - Based on crystallographic data 

obtained from the cyanobacterium Fremyella diplosiphon C-phycocyanin (42) that is highly 
- homologous to that of Anabaena sp. PCC7120, a molecular model was built for the 
Anabaena sp. PCC7120 C-phycocyanin incorporating the 24-residue N-terminal extension. In 
this particular model, the N-terminal extension can be led from inside to outside of the ring 
20 structure of the phycocyanin trimer, through a groove between P subunits on the P side of the 
trimer. This allows unhindered a face-to-a face stacking of two trimers to form a hexamer. 
The 24-residue N-terminal tag, occupying only the groove space in the p faces, also does not 
appear to interfere with stacking of hexamers on p faces to form rods of the phycobilisome. 
The first N-terminal 13 of the 24 residues of the tag, that include the six His residues, are 
25 completely exposed outside of the rod surface, consistent with the observed affinity 
purification of phycobilisomes by IMAC (as described above). 

In summary, methodology developed in this study allows (a) high level expression of 
holo-HTa and holo-HTP in Anabaena sp. PCC7120 and a variety of its mutant derivatives; 
(b) facile affinity purification of the His-tagged polypeptides and of complexes into which 
30 they are incorporated, including intact phycobilisomes; (c) study of in vivo assembly and bilin 
addition; and (d) analysis of site-specific mutants of C-phycocyanin. It is evident that the 
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approaches described here are immediately generalizable to other Anabaena 
phycobiliproteins, such as allophycocyanin and phycoerythrocyanin (3, 7, 29). 

With appropriate choice of organism, these approaches can be applied to the 
expression and study of other phycobiliproteins including phycoerythrins. Finally, the studies 
described here provide for the design and expression of diverse fusion proteins in Anabaena 
sp. with important outcomes for the successful production of modular fluorescent 
phycobiliprotein tags and for the exploration of heterologous protein folding in cyanobacteria 
(9, 10). 

Example B. Recombinant phycobiliprotein fusion proteins with oligomerization and 
biospecific recognition domains. Here we describe the design and expression of more 
complex recombinant phycobiliprotein constructs which incorporate oligomerization and 
biospecific recognition domains and in some of which the a and p subunits are covalently 
bridged. Materials and methods essentially as described in Example A are not-restated. 

Assays of protein binding to streptavidin - Western hybridization was carried out 
essentially as described (14). Proteins separated on SDS-PAGE were transferred to 
polyvinylidene difluoride (PVDF) membrane (Immobilon; Millipore Corp., Bedford, MA) 
using a mini Trans-Blot cell (BioRad Laboratories). Electrophoretic blotting was carried out 
in ice-cooled transfer buffer for 1 hr at 100 V. The PVDF membrane was then blocked with 
3% bovine serum albumin in PBS buffer (1 15 mM NaCl, 4 mM KH2PO4, 16 mM Na 2 HP04, 
pH 7.4) plus 0.5% Tween-20 (Sigma Chemical Co.). Binding of membrane-bound proteins to 
streptavidin was assayed by incubating the membrane at 25 °C for 1 hr in PBS buffer plus 
0.1% Tween-20 and 1 :5000-diluted streptavidin-alkaline phosphatase conjugate (Prozyme 
Inc., San Leandro, CA). Alkaline phosphatase-mediated color development on the membrane 
using chromogenic substrates 5-bromo-4-chloro-3-indolyl phosphate (BCIP) and nitro blue 
tetrazolium (NBT) was carried out as described (14) except that 200 mM Tris-HCl pH 8.8 
was used as the developing buffer. 

Alternatively, streptavidin-coated agarose beads were used to test binding of Strep2- 
tagged phycocyanin. As has been described (56), beads coated with recombinant core 
streptavidin (Biometra Inc., Tampa, FL) gave better results than those coated with natural 
streptavidin (Sigma Chemical Co. or Prozyme Inc.). Streptavidin-coated beads were 
incubated at 4 °C for about 20 min with excess amount of purified Strep2 -tagged 
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phycocyanin, washed twice with buffer W (100 mM Tris-HCl pH 8.0, 1 mM EDTA), and 
visualized by fluorescence. Phycocyanin-labeled beads were observed either in batch with 
ultraviolet (>312 nm) excitation of fluorescence viewed through a 550 nm long-pass filter, or 
under an epifluorescence microscope (the BH-2 system from Olympus America Inc., Lake 

5 Success, NY) with excitation light through a 450-480 nm band-pass filter and emission 
observed through a 5 1 5 nm long-pass filter. 

Constiiiction of modular cloning vectors for protein expression - A series of cloning 
and expression vectors was designed and constructed for expression of fusion proteins with 
different functional domains and tags on the N-terminus. To facilitate shuffling of the 

10 functional domains to produce desired combinations, DNA cassettes coding for these 

domains were designed as exchangeable modules. Care was taken in the design to ensure that 
only optimal codons (in respect to E. coli and Anabaena sp. PCC7120) were used, and that no 
restriction sites (for 6-bp cutters) were present in the core sequences, except for designed-in 
signature sites. 

1 5 The expression vector pBS 1 50v [Example A; GenBank Accession number 

AF177932] was used as a template in the engineering of functional domains. Generally, a pair 
of designed oligonucleotide primers was used to run inverse PCR, producing a new plasmid 
in which the sequence from bp 3892 to 3938 (encoding the 6xHis affinity tag and a spacer) of 
pBS150v is replaced by a desired functional module. 

20 Replacement with the 56-bp NcoI-BspMR Strep2 module generated plasmid pBS283v 

(4,644 bp; Table 4). The 10-residue Strep2 tag is able to bind specifically to streptavidin (57). 

Replacement with the 146-bp Ncol-Agel combination module (6xHis plus GCN4-pII) 
gave plasmid pBS3 1 1 v (4,734 bp; Table 4), and with the 1 64-bp combination module (Strep2 
plus GCN4-pLI) gave pBS303v (4,752 bp; Table 4). Sequences for the GCN4-pII and GCN4- 

25 . pLI coiled coil domains were modified from part of the original Saccharomyces cerevisiae 
GCN4 sequence encoding the leucine zipper domain (58, 59), to conform with the pll 
sequence (positions a and d are both He) or the pLI sequence (positions a and d are Leu and 
lie, respectively) (60). A BglR site was engineered into the GCN4-pII coding sequence as a 
signature site, as was an AsuH site in GCN4-pLI. 
30 With plasmids pBS283, pBS303, and pBS3 1 1 in hand, four functional domains, 

6xHis, Strep2, GCN4-pII, and GCN4-pLI, are available for further manipulation. Since the 
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different functional domains are modular in design, different cloning and expression plasmids 
with various combinations of the modules were easily made by simple recombination cloning 
(Table 4). All resulting plasmids have the modular region end with the second half of either 
the BspMU or the Agel site, encoding a Gly, immediately upstream from the recognition and 
cleavage sequence for the tobacco etch virus (TEV) endoprotease (Example A, herein, 23 , 
61). The 7-residue TEV site was retained in all plasmids to act as a spacer between the 
modular functional domains and the multiple cloning sites into which genes of interest are 
inserted, and to provide a means of removing the modular functional domains from expressed 
fusion proteins if so desired. The LacZ a domain downstream from the TEV site was found to 
be functional in all cognate plasmids, allowing blue/white screening of inserts in the 
multicloning region. The multicloning region of these plasmids can be greatly enhanced in 
lieu of the LacZ a function by recombination with plasmid pBS152v [(Example A, herein); 
GenBank Accession number AF177933]. For plasmids encoding phycocyanin subunits, the 
Ndel-cpcA-Hindlll and Ndel-cpcB-EcoKl fragments were taken from plasmids pBS185 and 
pBS251, respectively (Example A, herein). 

Construction of genes encoding phycocyanin subunit-fusion monomers - X-ray 
crystallographic data obtained from the highly homologous C-phycocyanin from another 
cyanobacterium Fremyella diplosiphon (42) were used to design subunit fusions of the 
Anabaena phycocyanin, see Example A. Straight-line distance between a carbons of the last 
residue of a subunit and the first residue of P subunit was measured to be 29.5 A. The 
distance between the last residue of p subunit and the first residue of a subunit was 32.6 A. 
Although the average sum of three chemical bonds' length of a peptide unit is 4.3 A, the 
actual straight-line distances between two a carbons were found to range from 3.2 to 3.7 A 
among several peptide units measured in the crystal structure. An 1 1 -residue linker was 
therefore chosen to link the C-terminus of one subunit to the N-terminus of another. The 11- 
residue linker is sufficiently long to.bridge the distance (29.5 to 32.6 A) between the N- and 
C-termini of the two subunits, and provides some leeway for bending should this be required 
for higher-order assembly of the resultant fusion phycocyanins. 

One obvious difference between phycocyanins of Fremyella diplosiphon and of 
Anabaena sp. PCC7120 is the lack of the initial Met residue in the Anabaena subunits as a 
result of posttranslational modification (Example A, herein). To reduce possible steric 
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interference in the engineered Anabaena protein, an Ala was inserted in place of the missing 
Met at the N-terminus of the subunit following the fusion linker. Since both a and P subunits 
have an a-helical C-terminal region made up mostly of small hydrophobic residues, Ala was 
chosen as the first residue in the fusion linker to maintain the local hydrophobic environment. 
5 The rest of the linker peptide (LI 1) was designed for high structural flexibility, and consists 
mostly of Gly residues and two Ser. Incorporation of the signature site Xmril in the LI 1 
coding sequence not only facilitates characterization of PGR products, but also allows easy 
construction of fusions of other proteins to the C-terminus of phycocyanin a or P subunits 
(Example C, herein). 

10 Designed oligonucleotide primers incorporating the LI 1 linker sequence were used 

for inverse PCR with template DNA consisting of both pBS l 85 and pBS25 1 plasmids 
(Example A, herein), giving plasmids pBS310 (Ndel-cpcA-Ll \-cpcB-EcoRI in pUC19) and 
pBS307 (Ndel-cpcB-Lli-cpcA-HindOX). Once sequenced to confirm fidelity, the 1047-bp 
Ndel-EcdRI fragment of pBS310 was cloned into Ndel- and EcoRI-digested pBS150 

15 (Example A, herein), giving plasmid pBS320 which encodes 6xHis~tagged CpcA-Lll-CpcB 
fusion phycocyanin. In like manner pBS3 15 was made to encode the 6xHis-tagged CpcB- 
Ll 1-CpcA fusion protein (Table 4). A PCR-generated spontaneous mutant product, in which 
6 bases in the signature Xmnl site of the LI 1 linker sequence had been deleted (giving the L9 
linker), was also used in this study (pBS319; see Table 4). 

20 . ' All other molecular biological manipulations followed standard procedures (14). To 

ensure sequence fidelity, all DNA fragments that had gone through the PCR process (16) 
were sequenced using the PRISM 373 DNA sequencing system and the dye-terminator cycle- 
sequencing kit from Applied Biosystems (Foster City, CA). Analyses of DNA and amino- 
acid sequence data were performed with the programs Editbase (Purdue Research Foundation 

25 and USDA/ARS) and Lasergene (DNASTAR Inc., Madison, WI). 

Nomenclature - A recombinant polypeptide encoded by an expression plasmid is 
referred to. by the plasmid number, and the His-tagged protein fraction isolated by affinity 
chromatography is designated by the organism in which the recombinant polypeptide is 
expressed. For example, a phycocyanin a subunit with both 6xHis and Strep2 tags encoded 

30 by plasmid pBS327, expressed in E. coli and purified by IMAC is designated Ec327. The 
purified, characterized protein is then designated as HT-Strep2-a. The same construct 
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expressed in Anabaena sp. PCC7120 is designated An327. However, the purified protein is 
HT-Strep2-a:p, because in Anabaena sp. the HT-Strep2-a forms a stable complex with the 
holophycocyanin (3 subunit. Phycobiliproteins expressed in E. coli are apoproteins (lacking 
attached phycobilins), whereas those expressed in Anabaena sp. are holoproteins, unless 
5 otherwise noted. 

Design of subunit-fusion phycocyanins - A His-tag at the N-terminus of either an a or 
a p subunit does not interfere with the folding of these polypeptides into their native 
structures (Example A, herein). Two possible a/|3 fusions can be generated with an 11 -residue 
(L-l 1) linker: HTa-Ll 1-p and HTp-Ll 1-a. In either construct, one subunit, the a or the b, 

10 would have a 24-amino acid extension (including the 6xHis tag) at the N-terminus and a large 
fusion, linked through LI 1, at its C-terminus. Both constructs were made to determine the 
folding and spectroscopic properties of such fusions. 

Expression ofHis-tagged phycocyanin fi-a fusion protein in E. coli - HTb-Ll 1-cc is 
expressed in E. coli as the apoprotein (lacking the three phycocyanobilins). At 37 °C, in 

15 cultures induced with IPTG for 5 hrs, the fusion protein (Ec315) represented over 5% of total 
cellular proteins (>7.5 mg per liter of culture at 10 9 cells ml" 1 ), and at 30 °C, in cultures 
induced with IPTG for 12 hr, some 20% of total cellular proteins. HTp-Ll 1-a purified by 
IMAC could be concentrated up to 0.2 mM protein (nearly 8 mg ml" 1 ) in 50 raM Tris-HCl pH 
8.0, 50 iiiM NaCl, 50 mM KC1, 1 mM DTT, and 5% glycerol. The yield and solubility were 

20 markedly higher than that obtained upon expression of the individual His-tagged phycocyanin 
subunits, HTct (Ecl68) and HTp (Ec262), in E.coli (Example A, herein). With IPTG 
induction at 30 °C, in the latter cases, yields were about 0.1 mg per liter of culture. Upon 
IPTG induction at 37 °C, -60% of the individually expressed 6xHis-tagged subunits 
remained in solution while the balance formed inclusion bodies. Thus coexpression of the 

25 apo-a and apo-p subunits, inherent in the expression of HTP-L1 1-a, evidently promotes 
folding and concomitant retention of solubility of the recombinant apoprotein. 

Expression of His-tagged phycocyanin fi-a fusion pi'otein in Anabaena sp. - Cultures 
of Anabaena sp. PCC7120(pBS315) expressing HTp-Ll 1-a appeared more blue than that of 
the wild type. Whole-cell absorbance spectra showed a much higher ratio of 

30 phycocyammchlorophyll a. Despite appearing very healthy, Anabaena sp. 

PCC7120(pBS315) cultures grew about 30% more slowly than the wild type. 
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When cell lysate supernatant from Anabaena sp. PCC7120(pBS315) was passed 
through a Ni 2+ -NTA affinity column, usually > 33% of the total phycobiliproteins (as 
estimated from A 62 onm) was retained on the column and then eluted with imidazole (resultant 
protein denoted An3 15). Such high yields are comparable to that of the strain expressing HTa 
5 (> 30% of A 62 onm)> and better than that of the. strain expressing HT(3 (~ 16%) (Example A, 
herein). 

SDS-PAGE analysis showed that the An315 fraction consisted almost entirely of 
HTp-Ll 1-a, along with a small amount (<10%) of native phycocyanin a and p subunits in a 
-1:1 ratio. This finding suggests that the fusion construct interacts relatively normally with 

10 unmodified phycocyanin in the cell. Such interaction to form higher assemblies is supported 
by analysis of Anabaena sp. PCC7120(pBS315) phycobilisomes, which were indeed found to 
contain HTp-Ll 1-a. Much more of the fusion protein was found in the phycobilisome 
substructures (rods and hexamers/trimers) than in intact phycobilisomes, suggesting that 
phycobilisomes with a higher content of HTp-Ll 1-a may be less stable during the 

15 preparation procedure. The high amount of HTfWLl 1 -a relative to native phycobiliproteins in 
the hexamer/trimer fraction also suggests that the construct is turned over more slowly in the 
cell. In this context, it is noteworthy that the whole-cell phycobiliprotein fluorescence 
emission of Anabaena sp. PCC7120(pBS315) was twice as high as that of strains expressing 
His-tagged aor|5 subunit, and the growth media of this strain often turned blue, the color of 

20 phycocyanin. 

On SEC-HPLC the An315 preparation fractionated into hexamers and trimers only, 
with no monomers observed (Table 5). This result parallels those obtained previously with 
Anl68 (HTa:p) and An262 (a:HTP), where the a and p subunits are not covalently linked 
(Example A, herein). The trimer component, (HTp-Ll l-cc)3, had a ^nmx at ^22 run at >10" 6 

25 M protein. (HTp-Ll l-a)3 dissociated to monomers at very low protein concentrations, with 
absorption maxima shifting blue to as low as 615 nm (Table 6), a behavior characteristic of 
native phycocyanin and of (HTa:p)3 and (a:HTp)3 holoproteins. At high dilution, HTp-Ll 1- 
a has an A max -A360 nm of 6.2, similar to the values observed for monomeric wild-type and 
His-tagged a:p phycocyanins (Table 6; Example A, herein). The sfor (HTp-Ll l-a)3 at >1 0" 6 

30 M protein concentration was determined to be 900,000 M" 1 cm" 1 . The fluorescence emission 
spectrum for 560 nm excitation of HTp-Ll 1-a, with Aat 643 nm and a Of of 0.21, were 
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virtually identical to those of wild-type phycocyanin. The fluorescence excitation spectrum 
for 655 ran emission corresponded well with the absorbance spectrum (Table 6). The above 
data show that the HTp-Ll 1-a fusion phycocyanin has properties virtually identical to those 
of the wild-type and His-tagged a:P holo-proteins, and indicate that HTp-Ll 1-a expressed in 
Anabaena sp. has a full complement of bilins, i.e., one PCB per a and two per p subunit. 

Mass spectral analysis of the SEC-HPLC trimer fraction, (HTp-Ll I-0O3, gave a value 
of 40,993.2±6.8 which corresponds well to the theoretical value (41,1 18.8 Dal) of the 
hdloprotein HTp-Ll 1-a with the initial Met residue (131.1 Dal) removed. The mass spectral 
analysis not only confirms the full posttranslational chromophorylation on both the a and P 
subunit moieties, but also indicates presence of the posttranslational methylation on the y- 
amino group of the Asn 72 residue of the p subunit domain (51,52). 

Expression of His-tagged phycocyanin a-j3 fusion protein in Anabaena sp. - Like 
HTp-Lll-a apoprotein (Ec315), HTa-Ll 1-P (Ec320) was expressed in E. coli in very high 
yield in soluble form. The phenotype and growth rate of cultures of Anabaena sp. 
PCC7120(pBS320) expressing HTa-Ll 1-P were similar to those of strains expressing only 
His-tagged a or P subunits. The whole-cell absorbance spectrum of Anabaena sp. 
PCC7120(pBS320) was similar to that of the wild type but with a slightly higher and blue- 
shifted contribution from phycocyanin, and about 50% higher whole-cell phycobiliprotein 
fluorescence was observed. 

Like HTp-Ll 1-a (An315), HTa-Ll 1-P (An320) was also expressed at very high yield 
in A nabaena sp. 5 often accounting for > 32% of total cellular phycobiliproteins. SDS-PAGE 
showed that affinity-purified An320 proteins were almost entirely HTa-Ll 1-p, with a small 
amount of copurified native phycocyanin a and p subunits. The ratio of copurified a to p 
subunits was ^ 2, suggesting a reduced affinity of HTa-Ll 1-p for native phycocyanin p 
subunits. 

On SEC-HPLC, the An320 fraction separates into three components: monomers, 
trimers, and hexamers. It is noteworthy that native C-phycocyanin under the same 
chromatographic conditions was mainly trimeric with very little monomer, whereas the HTa- 
Ll 1-p preparation was a - 1:1 mixture of trimers and monomers (Table 5). The theoretical 
molecular weights from these components are 13, 19, and 9% higher, respectively, than the 
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values observed (Table 5). These discrepancies indicate that the radius of gyration of HTa- 
Ll 1 -p is smaller than that of the wild-type phycocyanin ccp monomer. 

At >10" 6 M protein concentration, the SEC-HPLC trimer and monomer fractions of 
HTa-Ll 1-p had X max at 608 and 605 ran, respectively. This likely accounts for the blue- 
shifted phycobiliprotein contribution to the whole-cell absorbance spectrum of Anabaena sp. 
PCC7120(pBS320). Like wild-type phycocyanin, the trimer fraction showed a blue shift in its 
absorbance spectrum down to 605 nm with decreasing protein concentration, indicative of 
dissociation of the trimers to monomers. At high dilution, HTa-Ll 1-P has an A ma x:A360 nm 
of 5.2, substantially lower than the value of 6.4 observed for monomeric wild-type and 
HTa:p phycocyanins (Table 6; Example A). 

The unusual characteristics exhibited by the HTa-Ll 1-P protein purified from 
Anabaena sp. were similar to those of apo-HTa:p (Example A, herein). The trimer and 
monomer SEC fractions of HTa-Ll 1-P were therefore compared to the HTp-Ll 1-a holo- 
protein (both fusion proteins have identical amino acid composition). When denatured in acid 
urea, the trimeric fraction of HTa-Ll 1-p had PCB absorbance about midway between three : 
PCBs per protein (holoprotein) and two PCBs per protein, while the monomeric fraction gave 
absorbance very close to that of two PCBs per protein. An interpretation of that result is that 
the a domain of HTa-Ll 1-P fusion protein is not fully chromophorylated. Only about 50% of 
the trimeric HTa-Ll 1-p proteins had PCB in the a domain, while most monomeric HTa- 
Ll 1-p proteins had no PCB addition in the a domain. Because of the nearly equal distribution 
between trimers and.mqnorners, fewer than 30% of the a subunit domains of affinity-purified 
HTa-Ll 1-p proteins carry bilin. It is noteworthy that apo-HTa:holo-p and apo-a:holo-HTp 
proteins also run mostly as monomers on SEC-HPLC, with apparent molecular weights 
substantially smaller than theoretical calculations (Example A, herein). The fact that most of 
the HTa-Ll 1-p constructs have an apo-a domain could also explain why more native a than 
native P phycocyanin subunits were copurified with HTa-Ll 1-p, since the a-Cys 84 bilin is 
involved in the a:p interaction and in stable monomer packing (55, 62) 

Electrospray mass spectral analysis of the SEC-HPLC monomer fraction of HTa-Ll 1- 
P gave only one major peak with a value of 40,327.8±5.7, corresponding well with the value 
of 40,331.7 daltons, obtained by subtracting from 41,1 18.8 Dal (molecular weight of holo- 
HTa-Lll-P) the values 13L1 (initial Met residue), 57.0 Dal (the C-terminal or the second 
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residue from the N-terminus, Gly), 585.0 (one PCB), and 14.0 (one methyl group). This not 
only confirms the incomplete bilin addition (likely on the a domain) of HToc-Ll 1-P, but also 
suggests impaired posttranslational rnethylation on the y«amino group of Asn 72 of the p 
domain (51, 52). The latter is unexpected because HTp (Example A, herein) and HT(3-L1 1-ot 
(see above) purified from Anabaena sp. are both fully methylated on (3- Asn 72 . 

The SEC-HPLC trimer fraction of HTa-Ll 1-P (at >10~ 6 M) had a eof 677,000 M" 1 
cm" 1 The fluorescence emission spectrum for 560 nm excitation, with A,at 643 nm and a <l>f 
of 0.22, was virtually identical to that of wild-type phycocyanin. The fluorescence excitation 
spectrum for 655 nm emission corresponded reasonably well with the absorbance spectrum, 
with a slight shift in the maximum to 613 nm, likely due to the small amount of native 
phycocyanin in the preparation (Table 6). The SEC-HPLC monomer fraction of HTa-Ll 1-P 
had spectroscopic properties similar to those of apo-HTa:holo-p monomers (Example A). 

Effect of shortening the linker - A PCR-generated mutant, HTa-L9-p, with the linker 
Ala-(Gly)4-Ser-(Gly)3, was isolated in the course of producing the HTa-Ll 1-P construct, 
where LI 1 is Ala-(Gly)4-Ser-Gly-Ser-(Gly)3. In other respects the two constructs were 
identical. Like the HTa-Ll 1-p apoprotein (Ec320), the HTa-L9-p apoprotein (Ec319) was 
expressed in E. coli in high yield. HTa-L9-p (An319) isolated from Anabaena sp. exhibited 
the same, anomalous spectroscopic properties described above for the HTa-Ll 1-P construct. 
Analysis of the HTa-L9-P protein by SEC-HPLC showed trimers and monomers in a ratio of 
1 :4, significantly lower than the ratio of 1 : 1 observed for HTa-Ll 1-p. Since the extent of 
chrcniophorylation of trimer and monomer fractions of HTa-L9-P was similar to those of 
respective fractions of HTa-Ll 1-p, the reduced trimer formation by HTa-L9-p may be a 
result of conformation change induced by the shorter linker between the subunits. 

Expression of phycocyanin fusion proteins incorporating oligomerization domains - 
The dissociation of phycocyanins to the monomer at low protein concentration is highly 
undesirable when these proteins are used as fluorescent tags^ particularly because of the 
resultant decrease in the number of PCB chromophores per tag. The finding that recombinant 
phycocyanin subunits with a 24-residue extension at the N-terminus fold and assemble in the 
same manner as native phycocyanin subunits (Example A, herein) suggested the introduction 
of the GCN4 oligomerization domains (60) at the subunit N-tennini to produce stable 
phycocyanin oligomers. 
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Expression of phycocyanin a subunit fused at the N-terminus to the trimerization domain 
GCN4-pII - The 33-residue peptide GCN4-pII forms homotrimeric parallel coiled coils with 
Kd <10" 9 M (60). Plasmid pBS314 (encoding HT-pII-a) was constructed to express a GCN4- 
pII-CpcA fusion (Table 4). 
5 When HT-pII-a was expressed in E.coli, the recombinant protein was found almost 

entirely in inclusion bodies. Similar results were observed whether the induction was at 30 or 
37 °C. This behavior is in sharp contrast to that seen with HTa [Ecl68; (Example A, 
herein)], where most of the recombinant protein remains soluble. A likely explanation is that 
trimerization of the GCN4-pU in the fusion protein is faster than the folding of the 
10 phycocyanin subunit domain and interferes with folding of the latter by promoting random 
aggregation of the partially folded subunit domains. 

Cultures of Anabaena sp. PCC7 1 20(pBS3 1 4) expressing HT-pH-cc showed no 
negative phenotype, but had a bluer appearance than the wild type. Whole-cell absorbance 
spectra of Anabaena sp. PCC7120(pBS3 14) showed a higher ratio of 
15 phycocyaninrchlorophyll a. Whole-cell fluorescence was -70% higher than that of the wild 
type or cells expressing HTa. Sucrose density gradient fractionation of Anabaena sp. 
PCC7 120(pBS3 1 4) phycobilisome preparations showed that HT-pII-a represented a much 
higher proportion of the total phycocyanin in the trimer/hexamer fraction than in the intact 
phycobilisome fraction. 

20 Over 40% of the phycobiliprotein in Anabaena sp. PCC7 120(pBS3 1 4) cell lysate 

supernatant bound to the Ni 2+ -NTA column and eluted with 200 mM imidazole. This was the 
highest relative level of recombinant phycobiliprotein of any of the His-tagged constructs we 
have expressed in the course of these studies. An3 14 bound much more tightly to the Ni 2+ - 
NTA column than His-tagged phycocyanin constructs not containing oligomerization 

25 domains. The tightness of binding was evidenced by the slow desorption of the recombinant 
protein from the Ni 2 +-NTA resin and the larger volume of imidazole buffer required for 
elution. Such behavior is to be anticipated from a molecule in which three polypeptides, each 
with a 6xHis tag, are trimerized through the GCN4-pH coiled coil. SDS-PAGE analysis 
showed that the An3 1 4 phycobiliprotein fraction was a stoichiometric HT-pII-a:p holoprotein 

30 complex: Chromatography on SEC-HPLC showed that the HT-pII-a:P preparation was >75% 
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trimer with some higher molecular weight component s), presumably hexamer. Monomers 
were not detected. 

the spectroscopic properties of (HT-pII-a:p) 3 are compared with those of the native 
C-phycocyanin trimer, (a:P) 3 in Table 6. The trimerized construct has significantly higher 
A max :A3 60m n ratio than the native (a:P) 3 and a higher <D F . Moreover, the absorbance spectrum 
of (HT-pn-a:P) 3 was unchanged at low protein concentrations where native phycocyanin, 
HTct/.p, and a:HTp were monomeric with significantly blue-shifted absorbance maxima and 
decreased A max : A 36 o nm ratios (Table 6; Example A, herein). Fluorescence polarization 
measurements provided an independent assessment of the stability of (HT-pII-a:p) 3 at low 
protein concentrations. Iix trimeric phycocyanin, the fluorescence polarization is very low 
because of rapid energy transfer among the nine phycocyanobilin chromophores. In contrast, 
in the phycocyanin monomer, the three PCBs are well separated and depolarization through 
energy transfer is minimized. Our data show that at high protein concentrations, where 
various phycocyanin preparations are trimeric, their fluorescence polarization is similar and 
low, with values of 0.035 to 0.070. The fluorescence polarization of HTa:p and cc:HTp, like 
that of native phycocyanin, rose sharply at very low protein concentrations, indicative of 
trimer dissociation, whereas that of (HT-pH-a:p) 3 was independent of protein concentration. 

Expression of phycocyanin a subunit fused at the N-terminus to the tetramerization 
domain GCN4-pLI - The 33-residue peptide GCN4-pLI forms very stable homotetrameric 
parallel coiled coils (60). Several plasmids were constructed to express GCN4-pLI-cc fusion 
proteins (Table 4), including plasmids pBS321 (encoding HT-pLI-a) and pBS323 (encoding 
HT-Strep2-pLI-a). The 10-residue Strep2 tag was incorporated into the HT-Strep2-pLI-a to 
test whether it would confer streptavidin-binding specificity on the construct. 

When expressed in E. coli, GCN4-pLI-a fusion proteins, Ec321 (HT-pLI-a) and 
Ec323 (HT-Strep2-pLI-a), were mostly found in inclusion bodies. In contrast, when 
expressed vaAnabaena sp. the fusion proteins An321 and An323 remained soluble. The 
yields of these His-tagged phycobiliproteins from this strain were high, reaching > 20% of 
total phycobiliprotein. As described above for the GCN4-pII-cx fusion proteins, the GCN4- 
pLI-cc fusion protein (An321) bound very tightly to the Ni 2+ -NTA resin. SDS-PAGE analysis 
showed that An321 protein fraction purified by IMAC had the composition of HT-pLI-a*.p. 
Analysis by SEC-HPLC showed two components, a small amount of tetramer, (HT-pLI- 

38 



WO 01/46395 PCT7US00/35028 

a:p) 4 , and the balance as a much larger component. The elution position of the latter lay 
outside the range of calibration of the size standards for the column, but extrapolation of the 
calibration curve allowed an estimate of the size of the larger component. The calculated 
molecular weight was consistent with that of a trimer of tetramers (Table 5). 

Presumably, such a trimer would be formed by the interaction of three (HT-pLI-a:p) 4 
assemblies through one of their phycocyanin a:p domains (while the other three a:p domains 
forming a trimer) in a manner analogous to that leading to the formation of (HT-pn-a:p) 3 . In 
such a "trimer of tetramers", all a:p monomers are within phycocyanin trimers. Tetramers, 
each held together by a GCN4-pLI domain, would be the expected products of dissociation of 
such a "trimer of tetramers" at very low protein concentration. In accord with this expectation, 
the absorbance spectrum of HT-pLI-a:p at low protein concentration showed a 7^ max of 621 
nm and an A^zAaeonm ratio of 7.5 (Table 6), values very similar to those obtained for (HT- 

pII-a:P)3 (see above). 

HT-Strep2-pLI-a:p (An323) exhibited physical and spectroscopic properties identical 
to those of HT-pLI-a:P (An321) (Table 6), showing that the presence of the Strep2 sequence 
does not interfere with tetramerization by the GCN4-pLI domain. 

Expression ofsubunit-fusion phycocyanins fused at the N-terminus to the 
trimerization domain GCN4-pII - As described above, His-tagged P-a subunit-fusion 
phycocyanin (An315) has spectroscopic properties identical to those of HTa:pand a:HTp. 
Like those proteins, however, HT-P-L1 1-a also dissociates to monomers at high dilution. To. 
increase utility of the HT-P-L1 1-a fusion phycocyanin as a fluorescent label, we fused the 
GCN4-pE domain to this construct to enhance trimer stability. 

Plasmid pBS358 encodes HT-pII-p-Ll 1-a (Table 4). When expressed in E. colU HT- 
pII-P-Ll 1-a apoprotein was found mostly in inclusion bodies, but when expressed in 
Anabaena sp. the HT-pII-P-Ll 1-a protein remained soluble. Cultures of Anabaena sp. 
PCC7120(pBS358) had phenotypes very similar to those of the Anabaena sp. 
PCC7120(pBS315) (see above), including the slower growth, "bluer" color, and increased 
whole-cell fluorescence. Analysis of phycobilisome preparations from this strain showed that 
HT-pII-p-Ll 1-a was incorporated into the phycobilisomes. The 6xHis-tagged 
phycobiliprotein fraction (An358) purified by IMAC represented >22% of total cellular 
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phycobiliproteins. As noted above for An314 (HT-pII-a:p), the An358 fraction also bound 
very tightly to the Ni2+-NTA resin. 

SDS-PAGE analysis of the An353 fraction showed that it consisted >90% of HT-pII- 
p-Lll-a along with a small amount of copurified wild-type phycocyanin a and p subunits (in 
a ratio of 1:1). Analysis by SEC-HPLC at protein concentrations <500 ^tg/ml, showed that 
HT-pII-p-Ll 1-a preparation was largely trimeric with a small amount of a larger component 
(Table 5). Comparison of the spectroscopic properties of (HT-pII-p-Ll l-)3, purified by SEC- 
HPLC, with those of native C-phycocyanin trimer (aP)3, showed no significant differences 
(Table 6). The absorbance spectrum of (HT-pII-b-Ll l-a)3 did not change with dilution to 
very low protein concentration (Table 6), indicating that the GCN4-pII domain prevented 
dissociation of the trimer. 

The a-p fusion constructs purified from Anabaena sp., HTa-L9-p and HTa-Ll 1-P, 
preferentially form monomers even at micromolar protein concentrations (see above). The 
similar construct incorporating the GCN4-pII domain, HT-pII-a-Ll 1-P, is encoded by 
plasmid pBS362 (Table 4). The HT-pII-a-Ll 1-P protein purified from Anabaena sp., An362, 
was found to be largely trimeric (84%) with the remainder hexameric (Table 5). No monomer 
was observed in the SEC-HPLC analysis. It is evident that the trimeric state of HT-pH-cx- 
Ll 1-P depends on trimerization of the GCN4-pII domain. Although remaining trimeric at low 
protein concentrations, the HT-pII-a-Ll 1-P protein had spectroscopic properties similar to 
those of HTa-Ll 1-P protein (Table 6), including the incomplete chromophorylation of the a 
domain. 

Expression of recombinant phycocyanins with a tag conferring biospecificity - The 
constructs described above successfully address the requirement that the phycocyanin 
constructs maintain their oligomeric state at very low proteins concentrations. Next, we 
examine the introduction of an additional domain into such constructs, one which confers 
biospecific recognition. As a test case, we introduced a tag which binds specifically to 
streptavidin. Recombinant phycocyanins containing such a tag are immediately useful as 
reagents in the many applications of the widely used streptavidin-biotin methodologies (63). 

Expression of Strep2-tagged phycocyanin constructs - The 10-residue Strep2 peptide 
forms complex with streptavidin with a K D of 7.2xl0" 5 M (57). It is believed that the Step2 
tag retains its affinity for streptavidin when fused either to an N- or C-terminus of a protein 
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(57). Plasmids pBS327 and pBS323 were constructed to encode HT-Strep2-a and HT-Strep2- 
pLI-a 5 respectively (Table 4). E. coli and Anabaena strains expressing either fusion protein 
were phenotypically identical to those expressing the corresponding non-Strep2 -tagged 
' proteins. Affinity-purified HT-Strep2-a:p (An327) and HT-Strep2-pLI-a:p (An323) proteins 
5 also had spectroscopic and aggregation properties very similar to corresponding non-Strep2- 
tagged proteins HTcr.p (Anl68) and HT-pLI-a:{3 (An32 1 ) (Tables 5 and 6). 

When blotted on a membrane, HT-Sfrep2-cc (Ec327 and An327 and HT-Strep2-pLI-ot . 
(Ec323 and An323) were readily detected by probing with streptavidin-alkaline phosphatase 
conjugate, indicating that the Strep2 peptide sandwiched between the 6xHis tag and the 

10 GCN4-pLI domain still retains its affinity for streptavidin. When streptavidin-coated agarose 
beads were used as target, HT-Strep2-a:p holoprotein was largely removed from the beads on 
washing, a result anticipated from the low affinity of a single Strep2 tag for streptavidin [Kd 
- 7.2xl0- 5 M; (57)]. The HT-Strep2-pLI-a:(5 holoprotein, with its multiple Strep2 tags, 
bound much more strongly to the streptavidin-coated beads. This experiment demonstrates 

15 that oligomeric phycobiliprotein fusions with multiple Strep2 tags show the appropriate target 
specificity in streptavidin-based fluorescence assays. 

Example C. Recombinant phycobiliprotein fusion proteins with carboxyl-tenninal tagging. 

We present here the in vivo production of biotinylated phycocyanin constructs that are 
readily usable in the many well-developed biotin/avidin applications (63). Materials and 

20 methods essentially as described in Example A or B are not restated. 

Construction of expression plasmids - Plasmids coding for relevant protein fusion 
constructs are listed in Table 7. The expression vector pBS150v [(Example A); GenBank 
Accession number AF 177932] was used to make plasmid pBS339v (4,677 bp) for expression 
of His-tagged proteins capable of being biotinylated in E. coli. The 7 codons between the 

25 6xHis tag and the TEV site in pBS150v were replaced with the 21 codons specifying the BTN 
tag. The 13-residue sequence of the BTN tag was derived from the consensus sequence that 
can be biotinylated in E. coli (64). An Asull restriction site was designed into the BTN 
sequence as a signature site to facilitate PCR product characterization and subsequent 
recognition of the BTN tag sequence. The Ndel-cpcA-HindlU fragment from pBS 1 85 

30 (Example A) was inserted between Ndel and HindUl sites of pBS339v 5 giving plasmid 

pBS329v (5,138 bp) encoding 6xHis- and BTN-tagged phycocyanin a subunit (Table 7). The 
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BTN tag is positioned on the carboxyl-tenriinal side of the 6xHis tag, so that any recombinant . 
molecules that have lost the biospecificity tag through proteolysis are not retained on the 
IMAC column. The Ndel-glbN-Hindm fragment from plasmid pGlbN (66) was similarly 
cloned to give plasmid pBS355, encoding 6xHis- and BTN-tagged cyanoglobin (Table 7). 
5 The same Ndel-Hindm fragment cloned into pBS150v gave plasmid pBS121v (5,005 bp) 
encoding 6xHis-tagged cyanoglobin (Table 7). 

The cloning vector pBS370v (4,589 bp) for expression of C-terminal Strep tagged 
proteins (Table 7) was derived from the cloning vector pBS 1 52v [(Example A); GenBank 
accession No. AF1 77933] using inverse PCR. The 45-bp HindUL-Bgli fragment at the end of 
10 the multiple cloning sites was replaced with the 68-bp HindlU-Strep-Bgn. sequence. An Ehel 
site was engineered into the Strep tag coding sequence (56) as a signature site. Phycocyanin 
genes, in the form of Ndel-Xmril fragments from cognate plasmids (Example B) were inserted 
between Ndel and XmnI sites of pBS370v to generate phycocyanin-Strep tag fusions. The 
expression cloning vector pBS350v, an enhanced version of pBS152v, was created by 
15 replacing bp 3,801 to 4,070 of pBS152v with the 333-bp sequence described above. 

DNA fragment encoding the 1 14-AA C-terminal portion of the Anabaena sp. 
PCC7120 BCCP protein (BCCP1 14) was amplified from Anabaena genomic DNA. The 0.36- 
kb PCR fragment was digested with EcoT22l and cloned into the £coT22I site of pBS350v, 
giving plasmid pBS344v (4,977 bp). Again, phycocyanin genes, in the form of Ndel-Xmnl 
20 fragments from cognate plasmids (Example B), were inserted between Ndel and XmnI sites of 
pBS344v to generate phycocyanin-BCCP 114 fusion constructs with the BTN flexible linker. 
The 20-residue flexible linker between the two moieties was designed to (a) bear a thrombin 
recognition and cleavage site to allow separation of the fusion partners if so desired, and (b) 
be sufficiently long to allow packing of BCCP1 14 on the outside of the rod substructures 
25 after the fusion protein is assembled in phycobilisomes (Example A, Example B). The 

GCN4-pII trimerization domain was introduced via modular cloning (Example B) between 
the 6xHis tag and the TEV protease site N 1 of the phycocyanin genes. 

Expression of BTN-tagged fusion proteins - The 13-residue BTN tag has the 
consensus sequence of peptides found to be biotinylated in E. coli (64). Covalent attachment 
30 of a biotin to the Lys residue is apparently catalyzed by the biotin ligase encoded by the birA 
gene (65, 67). The BTN tag was fused to the N-terminus of the phycocyanin a subunit in an 
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attempt to produce in vivo biotinylated phycocyanin. When the HT-BTN-a phycocyanin 
subunit, encoded by plasmid pBS329, was expressed in E. coli, the purified Ec329 protein 
was shown biotinylated in Western analysis. The same protein produced in Anabaena sp. 
PCC7120 (An329) 5 however, was very poorly biotinylated, if at all, even when the culture 
5 medium had been supplemented with as much as 500 \xM biotin. Two other assays for the 
presence of biotin on An329, competition with HABA {2-[(4'-hydroxyphenyl)-azo]benzoic 
acid} for streptavidin in solution (68) and mobility shift in SEC-HPLC, indicated lack of 
biotinylation. Parallel results were obtained in a comparison of Ec317 (HT-BTN-pII-a) and 
An317 (HT-BTN-pn-cc:(3) proteins. 
10 Sucrose density gradient experiments showed that both HT-BTN-a (An329) and HT- 

BTN-pII-a (An3 1 7) proteins were assembled into the phycobilisomes. To test whether the 
lack of An3 17 and An329 biotinylation in vivo was due to the fact that the recombinant 
phycocyanins were quickly assembled into the phycobilisome upon biosynthesis and 
therefore not available to the biotin ligase, the cyanoglobin (GlbN) protein from 
1 5 cyanobacterium Nostoc commune (66) was used. This myoglobin-like monomelic 

hemoprotein has a specific subcellular location around the periphery of the cytosolic face of 
the cell membrane (69), but is not membrane-bound. His-tagged cyanoglobins HT-GlbN and 
HT-BTN-GlbN (encoded by plasmids pBS121 and pBS355, respectively; Table 7) expressed 
in both E. coli and Anabaena sp. were found to have absorbance spectra nearly identical to 
20 that described for the native GlbN (70). However, while the Ec355 protein was found to be, 
biotinylated, the An355 protein was not. The negative results for biotinylation in Anabaena 
sp. of all three different proteins tested, An317, An329, and An355, suggest that the BTN tag 
is not recognized by the Anabaena biotin ligase. 

Analysis of the IMAC-purified An329 proteins by SDS-PAGE incated the 
- 25 composition HT-BTN-a:f5. The An329 SEC-HPLC elution profile and spectroscopic 

characteristics were virtually identical to the non-BTN-tagged counterpart An 168 (HTa:(3; 
Table 8). The yield, however, was surprisingly low; An329 recovered by IMAC was <3% of 
total cellular phycobiliprotein, over 10-fold less than obtained for Anl68. This was also the 
case for An317 compared to its non-BTN-tagged counterpart An314 [HT-pII-a:P; (Example 
30 B)]. The yield of An355 was also dramatically lower than that of Anl21 . These results 
suggest that the BTN tag destabilizes the tag-bearing proteins in Anabaena cells. Since 
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An317 and An329 are assembled into the phycobilisomes, the latter may also be destabilized. 
Higher turnover of phycobilisomes and of An329 proteins in the cell may explain the 
observed phenotype of lower cellular phycobiliproteins in the strains Anabaena sp. 
PCC7120(pBS317) and Anabaena sp. PCC7120(pBS329). 

Expression of truncated Anabaena BCCP114 protein - Since the BTN tag was not 
biotinylated in Anabaena sp., naturally occurring biotinylated proteins were investigated. A 
truncated gene encoding the C-terminal 114 residues of the Anabaena BCCP protein, 
covering the biotinylation domain (corresponding to the E. coli BCCP84 that is sufficient for 
enzyme-catalyzed biotinylation) and a large portion of the flexible linker, was amplified from 
the genome and cloned into pBS350, giving plasmid pBS344 (Table 7). Plasmid pBS344 
encodes the 6xHis-tagged Anabaena BCCP 1 14 with a long linker bearing the TEV and 
thrombin protease sites. 

E. coli cultures expressing the 6xHis-tagged Anabaena BCCP1 1 4 protein (Ec344) 
grew normally, and produced very soluble Ec344 protein at >10 mg per liter of culture grown 
and induced at 37 °C. The protein was shown biotinylated by Western analysis. Electrospray 
mass spectroscopy analysis suggested that about 20% of the Ec344 protein produced under 
these particular conditions was biotinylated. Growth and. induction at 30 °C for longer period 
of time was attempted to increase the percentage of biotinylated holoprotein fraction of 
purified Ec344. However under these conditions, the yield of affinity purified protein was 
greatly decreased, possibly as a result of lowered production and increased degradation of the 
Ec344 protein. 

Cultures of Anabaena sp. PCC7120(pBS344) expressing the An344 protein were 
fairly healthy, albeit having a slightly reduced level of total cellular phycobiliproteins. The 
yield of An344 protein, however, was quite low (usually < 1 "mg per liter of dense culture), 
possibly as a result of proteolytic clipping of the relatively long linker between the 6xHis tag 
and the BCCP1 14 domain. The An344 protein was shown to be biotinylated in Western 
analysis. Electrospray mass spectral analysis indicated that about 40% of the molecules were 
holo- (i.e., biotinylated) proteins, substantially higher than the fraction of those produced in 
E. coli. These results encouraged the study of C-terminal phycocyanin a and p fusions with 
BCCP 114. 
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Expression of phycocyanin p subunit-BCCPl 14 fusion proteins - Plasmid pBS353 
encodes the fusion protein HTp-BCCPl 14 (Table 7). Overexpression of Ec353 in E. coli gave 
high yield of the protein in soluble form. Since expression of Ec262 (HTP) under similar 
conditions leads to substantial formation of inclusion bodies (Example A), the BCCP1 14 
5 domain appears to enhance the solubility and folding of the fusion protein. The HT(3- 
BCCP1 14 preparation from E. coli was found biotinylated in Western analysis. 

Cultures of Anabaena sp. PCC7120(pBS353) generally had an unhealthy, yellowish 
green appearance, and the growth rate was at least 20% slower than that of the wild type even 
under high-light conditions. The yellowish green color of the cultures results from -50% 
10 reduction of cellular phycocyanin as indicated by whole-cell absorption spectra. Upon IPTG 
induction of overproduction of the HTp-BCCPl 14 protein, cultures in late-log growth 
frequently stopped growing, and further incubation of the culture often led to general cell 
lysis. This unexpected result, which was also observed with the HTa-BCCPl 14 construct 
(An351; see below), led to the abandonment of IPTG induction of Aitabaena sp. 
15 PCC7120(pBS353) cultures. Expression of the HTp-BCCPl 14 protein, therefore, was left to 
the leaky trc promoter (Example A), and generally resulted in a fairly low yield of An353 (< 
5% of total cellular phycobiliproteins). Nonetheless, very high cell densities could be 
obtained with these uninduced cultures. 

The HTp-BCCP 114 protein was incorporated into phycobilisomes, with no obvious 
20 destabilizing effects. On SDS-PAGE analysis, however, the affinity-purified An3 5 3, protein 
had a substantially larger amount of the HTp-BCCPl 14 subunit than the partner phycocyanin 
holo-ot subunit, indicating a lowered stability of a:HTP-BCCPl 14 as compared with that of 
d:HTp. Also, a small fraction of the HTp-BCCPl 14 preparation had lost the BCCP1 14 
moiety by proteolysis. 

25 On SEC-HPLC the An353 preparation separated into two major components. The 

larger component, with a retention time of about 12.4 min, had an apparent mass of 1 83.7 
kDal, somewhat larger than 162.2 kDal calculated for the trimer, (a:HTp-BCCPl 14) 3 . On 
SDS-PAGE this larger fraction had only phycocyanin a and HTp-BCCPl 14 subunits, in 
nearly 1 :1 ratio. Upon exposure to Zn 2+ , fluorescence from the HTP-BCCP1 14 band under 

30 UV illumination was about twice as bright as that from the phycocyanin holo-a subunit, 
suggesting full bilin content (two PCBs per P) of the recombinant phycocyanin p subunit. 
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The smaller component, eluted at -14.4 min, contained mostly HTp-BCCPl 14, with an 
apparent molecular weight corresponding to that of the homodimer, (HTp-BCCPl 14)2- While 
this result is in accord with the observation that phycocyanin HTp subunits can form stable 
homodimers (Example A), it also suggests that the HTp-BCCPl 14 subunits have a lowered 
5 affinity for the phycocyanin a subunit, and may lose some of the a subunits during affinity 
purification. 

Polypeptides in the HTp-BCCPl 14 preparation lacking the BCCP1 14 portion 
represent a higher proportion of the fraction containing (HTp-BCCPl 14)2 than of the 
(a:HTp-BCCPl 14)3 fraction. Such proteolytic removal of the BCCP1 14 domain of Ec353 

10 was not observed in E. coli. The degradation observed in the preparations from Anabaena 
cells may therefore be the result of the much longer time the fusion protein An353 remains 
inside Anabaena cells, and may also reflect higher activity of Anabaena sp. proteases towards 
the long, thrombin site-bearing linker. 

Spectroscopic properties of the trimeric fraction, (a:HTP-BCCPl 14) 3 , were similar to 

15 those of (ct:HTP)3 (Table 8). The slightly blue-shifted Xaax of (a:HTp-BCCP114) 3 and the 
lower A max :A360nm ratio may reflect incomplete bilin addition to the p subunit, with a 
consequent perturbed conformation and lowered trimer stability in a portion of the 
preparation. 

Western analysis showed that HTp-BCCPl 14 produced in Anabaena sp. was 
20 biotinylated. Binding experiments with monomelic avidin-coated beads were performed to 
further assess the utility of An353 proteins as labels. When avidin molecules were in excess, 
-30% of the An353 preparation could be immobilized on the beads and then specifically 
eluted, indicating a level of HTp-BCCPl 14 biotinylation in Anabaena sp. of about 30%. 
When an excess of An353 was used, saturation by the recombinant phycocyanin of nearly all 
25 of the biotin-binding sites on the avidin-coated beads was achieved. With appropriate 
excitation, the stained beads emitted brilliant red fluorescence. 

Expression of HTJ3-BCCP1 14 fusion protein bearing the GCN4-pII trimerization 
domain - Only about a half of the HTp-BCCPl 14 expressed in Anabaena retained the 
phycocyanin oc subunit during purification. We had noted in earlier studies (Example B) that 
30 phycocyanin fusion proteins with GCN4-pH trimerization domains remained trimeric even at 
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very low protein concentrations and contained both subunits in stoichiometric amounts. This 
is most likely attributable to the greatly increased local concentration of subunits. 

HT-pII-P-BCCPl 14, encoded by plasmid pBS359 (Table 7), was expressed at high 
level in E. coli. About 25% of Ec359 could be isolated in soluble form, much more than seen 
with pH-phycocyanin subunit fusion constructs (Example B), again showing the increase in 
solubility attributable to the presence of the BCCP1 14 domain (see above). 

Cultures of Anabaena sp. PCC7120(pBS359) were yellowish, similar to those of 
Anabaena sp. PCC7120(pBS353), with an even slower growth rate. However, such cultures 
grew to very high cell densities after IPTG induction. 

- The HT-pn-fJ-BCCPl 14 protein was assembled into phycobilisomes with no obvious 
destabilizing effect. In contrast to An353, affinity-purified An359 protein preparation had the 
HT-pII-p-BCCPl 14 subunit and the phycocyanin holo-a subunit in a 1 : 1 ratio. The yield of 
An359 was surprisingly low, generally <2% of total cellular phycobiliproteins. Almost no 
HT-pIL-p lacking the BCCP1 14 moiety was observed, suggesting that in the stable trimers, 
(a:HT-pII-p-BCCPl 14) 3 , the long linker between p and BCCP1 14 is shielded from 
proteases. 

On SEC-HPLC, the An359 protein preparation migrates as a component with an 
apparent mass of 475.2 kDal, significantly higher than the 350.3 kDal calculated for a 
hexamer, [(a:HT-pII-P-BCCPl 14)3]2. A phycocyanin hexamer is normally formed by face- 
to-face stacking of two trimers (42). A model of the (a:HT-pII-p-BCCPl 14)3 trimer reveals , 
that a hexamer could form by two trimers stacking on the BCCP1 14 face, with the BCCP1 14 
domains on the outside of the trimer rings. Such a hexamer would have a larger radius of 
gyration and exhibit a higher apparent mass on SEC. It should be noted that at very low 
protein concentration the hexamer is expected. to dissociate into trimers. Spectroscopic 
properties of (a:HTp-BCCP114)3 (Table 8), were very similar to those of GCN4-pII-bearing 
constructs such as (HT-pII-a:p)3. 

Western analysis showed that HT-pH-P-BCCPl 14 obtained from Anabaena sp. was 
biotinylated. The utility of this construct as a fluorescent label was explored in binding 
experiments with avidin-coated beads. With avidin in excess, about 75% of (a:HT-pII-p- 
BCCP1 14)3 w ^s immobilized on the beads and then specifically eluted off. The extent of 
biotinylation of the BCCP1 14 domain in HT-pII-P-BCCPl 14 proteins is presumably -30%, 
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similar to that seen in HTP-BCCP1 14 (An353). The higher binding percentage is anticipated, 
since only one of the three BCCP1 14 domains needs to be biotinylated for the entire trimef to 
bind to the beads. When an excess amount of (a:HT-pII-p-BCCPl 14)3 was used, the amount 
of a:HT-pII-P-BCCPl 14 monomer equivalents immobilized was nearly 2.5-fold that of 
5 monomeric avidins. Under phycocyanin excitation, (a:HT-pH-P-BCCPl 14)3~stained avidin- 
and streptavidin-coated beads were highly fluorescent. 

Expression of phycocyanin-BCCPl 14 fusion protein with covalently bridged a and f3 
subunits - Constructs with covalently bridged a and P subunits provide a way of ensuring 1:1 
a:P stoichiometry, as demonstrated earlier with the HTa-Ll 1-P construct (Example B). HTa- 

10 LI 1-P-BCCP1 14 encoded by plasmid pBS361 (Table 7) was expressed well in E. coli and 
acceptably in Anabaena sp. 

HTa-Ll 1-P-BCCP1 14 protein was assembled into the phycobilisome. As with HTa- 
Ll 1-p (Example B), such phycobilisomes were less stable. The affinity-purified An361 
fraction represented —10% of total cellular phycobiliprotein and consisted >90% of HTa-Ll 1- 

15 P-BCCP1 14. A small amount of native phycocyanin a and p subunits copurified with HTa- 
Ll 1-p-BCCPl 14, indicating that the a and p subunits in the fusion construct retained their 
ability to interact with native subunits. A small fraction of the HTa-Ll 1-P-BCCP 114 proteins 
had lost the BCCP1 14 portion by proteolysis. 

SEC-HPLC fractionation showed that the An361 preparation consisted of trimers and 

20 monomers in an -1 :1 ratio. Spectroscopic properties of the trimers, (HTa-Ll 1-p-BCCPl 14)3, 
were found very similar to those of (HTa-Ll l-p>3 (Table 8), indicating that carboxyl 
terminal attachment to BCCP1 14 did not perturb the phycocyanin subunit domains. In a 
related construct, involving a carboxyl terminal fusion of the a subunit, (HTa-Ll 1 -(3)3, bilin 
addition to the a subunit domain was shown to be very incomplete (Example B). Since 

25 (HTa-Ll l-p)3 and (HTa-Ll 1-p-BCCPl 14)3 have virtually identical spectroscopic properties 
(Table 8), bilin addition to the a subunit domain of (HTa-Ll 1-p-BCCPl 14)3 is presumably 
also incomplete. 

The HTa-Ll 1 -p-BCCPl 14 protein preparation from Anabaena sp. was shown to be 
biotinylated in Western analysis. In binding experiments with streptavidin- and avidin-coated 
30 beads, the An361 preparation gave results similar to those obtained with HTp-BCCPl 14 (see 
above), indicating -30% biotinylation of the BCCP domain of HTa-Ll 1-p-BCCPl 14. 
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Expression of phycocyanin-BCCPl 14 fusion protein with covalently bridged a and J3 
subunits and the GCN4~pII trimerization domain - Plasmid pBS365 was constructed to 
encode the HT-pII-a-Ll 1-P-BCCP1 14 (Table 7). Unlike Ec362 (HT-pII-cx-Ll 1-p) which is 
found almost entirely in inclusion bodies (Example B), Ec365 expressed in E. coli remained 
5 soluble, again indicating the solubility-promoting effect of the BCCP 114 domain. 

HT-pII-a-Lll-p-BCCP114, expressed in Anabaena sp. PCC7120(pBS365), was 
found to be assembled into the phycobilisome. The yield of affinity-purified An365 was >5% 
of total cellular phycobiliprotein. The An365 preparation consisted of >90% HT-pII-a-Ll 1- 
(3-BCCP1 14 with a small amount of copurifying native phycocyanin a and (3 subunits. 

10 Virtually no loss of the BCCP 1 14 moiety was observed. 

On SEC-HPLC, the An365 preparation was found to consist mostly of hexamers, with 
a small fraction of trimers. (HT-pE-a-Lll-p-BCCP 114)3 had spectroscopic properties similar 
to those of (HT-pII-a-Lll-p)3 (Table 8), again consistent with incomplete bilin addition to 
the phycocyanin a domain (Example B). 

15 The An365 from Anabaena sp. was shown to be biotinylated in Western analysis and 

gave results in binding experiments with monomeric avidin-coated beads quantitatively 
similar to those obtained with An359, (a:HT-pII-p-BCCPl 14)3. Streptavidin-coated beads 
stained with excess of An365 appeared blue and emitted brilliant red fluorescence under 
appropriate excitation. * 

20 Expression of phycocyanin asubunit-BCCPl 14 fusion proteins - Recombinant 

phycocyanin a subunits, with the amino-terminus fused to polypeptides of varying length, 
display unmodified bilin content and spectroscopic properties (Example A, Example B). As 
noted above, fusions at the carboxyl terminus of the a subumt interfered with bilin addition. 
To examine whether this behavior is specific to particular C-terminal fusions or is general, 

25 four phycocyanin a-BCCPl 14 fusion constructs analogous to the (3-BCCP fusions described 
above were prepared (Table 7). 

Expression of the four constructs, Ec351, Ec357, Ec360, and Ec364, in E. coli gave 
results very similar to those observed with the four corresponding phycocyanin P-BCCP1 14 
constructs (see above). Expression of the four a-BCCPl 14 constructs in Anabaena sp., 

30 however, gave very different results. 
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An351 (HTa-BCCPl 14) represented < 0.5% of total cellular phycobiliprotein. SDS- 
PAGE showed that the HTa-BCCPl 14 polypeptide had a very low bilin content, indicating 
that the BCCP1 14 extension on the a subunit greatly interferes with bilin addition. The low 
yield is likely attributable to the ease of proteolysis of the apo-HTa-BCCPl 14 polypeptide. 
The An351 preparation contained little phycocyanin P subunit, suggesting reduced affinity to 
the apo-HTct-BCCPl 14. The proteolytic cleavage of the long peptide linker between a and 
BCCP114 was also high in the apo-HTa-BCCP114 polypeptide. An351 preparations often 
contained a substantial amount of degradation product of HTa-BCCPl 14 in which the 
BCCP1 14 moiety had been clipped off. The extremely low yield of An351 also suggests that 
the protein is rapidly turned over in the cell. 

Results from expression of An357 (Table 7) were nearly identical to those from 
An351, except that the native phycocyanin p subunit was copurified with the (mostly apo-) 
HT-pII-a-BCCPl 14 subunit in nearly 1:1 ratio. Results from expression of An360 (HTp- 
Ll 1-a-BCCPl 14) and An364 (HT-pII-p-Ll 1-a-BCCPl 14) were very similar. The yield was 
-5% of total cellular phycobiliprotein, substantially higher than that obtained from An35 1 
and An357. A large percentage of the purified fusion protein, however, had lost the BCCP1 14 
moiety. These results showed that interference with bilin addition to the phycocyanin a- 
subunit domain was a feature common to all of these different BCCP1 14 fusion constructs. . 

Expression of phycocyanin a subunit-Strep tag fusion proteins - To address the 
question of the possible dependence of bilin addition to the phycocyanin a subunit on the size 
of the carboxyl-terminal fusion partner, such fusions were prepared with the 10 -residue Strep 
tag (Table 7). 

The yield of the His-tagged protein preparation, An386, from Anabaena sp. 
PCC7120(pBS386) expressing HTa-Strep was <1% of total cellular phycobiliprotein (in 
great contrast to >30% usually obtained from cells expressing HTa). On SEC-HPLC the 
An386 preparation runs almost entirely as monomer, with a very, small amount of trimers. 
The monomer peak contained HTa-Strep and native phycocyanin p in nearly 1 : 1 ratio. While 
the p subunit was the normal holoprotein, the HTa-Strep subunits were almost entirely apo- 
subunits. The apo-HTa-Strep appeared to have lowered affinity for the phycocyanin p 
subunit, as indicated by the observation that fractions collected from the trailing part of the 
monomer peak on SEC-HPLC were enriched in the apo-HTa-Strep subunit relative to the 
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holo-f5 subunit. The spectroscopic properties of the HTa-Strep:p monomer were very similar 
to those of the apo-HTa:holo-p monomer (Table 8; Example A), including the characteristic 
broad peak in the fluorescence excitation spectra. Thus, even a short extension on the 
carboxyl terminus of phycocyanin a subunit affects the posttranslational bilin addition. 

Expression of phycocyanin fusion protein with covalently bridged a and psubunits 
and the Strep tag -In a. final attempt to address the problem of incomplete bilin addition in 
carboxyl terminal fusions of the phycocyanin a-subunit, we examined behavior of HTp-Ll 1- 
a-Strep fusions (encoded by plasmid pBS394; Table 7). The yield of affinity-purified An394 
was -10% of total cellular phycobiliprotein. Unlike the An315 preparation (from cells 
expressing HTp-Ll 1-a) that consists mostly of trimers, the An394 proteins is mostly 
monomeric, with small amounts of trimeric and hexameric components. Proteins in the highei 
assembly states had a higher bilin content, with those in the hexamer fraction approaching 
three PCBs per HTp-Ll 1-a-Strep molecule, i.e., holoproteins. Proteins in the monomer 
fraction on average contained two PCBs per HTp-Ll 1-a-Strep molecule, most likely lacking 
the a subunit bilin. In total, <10% of the polypeptides in the An394 preparation carried bilin 
on the a subunit domain, a small but significant improvement over the HTa-Strep:p 
construct. Most An394 preparation could be immobilized on streptavidin-coated beads, 
showing that the Strep tag in the HTp-Ll 1-a-Strep molecules has retained its affinity for 
streptavidin and is available for binding. 

TABLE 1 

Properties of Anabaena sp. PCC7120 and derivative strains and of expression vectors 
carried by these strains relevant to the production of His-tagged C-phycocyanin a and P 



subunits 



Strain and vector^ 



Strain genomic 



Polypeptide construct 



characteristics 



encoded on vector 



PCC7120(pBSl68) 
PCC7120(pBS262) 
B646(pBS168) 



Wild-type 



HTa 



Wild-type 

cpcBAC b has a transposon inserted 



HTP 



HTa 
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B646(pBS262) 
B64328(pBS168) 

5 

B64328(pBS262) 
B64407(pBS168) 

B64407(pBS262) 
10 PCC7120(pBS167) 
PCC7120(pBS162) 



a The expression vectors, specified in parentheses, are derived from pBS150v and pBS150 as 
15 described in the text. 

b Genes cpcB and cpcA encode the P and a subunits of C-phycocyanin and cpcC encodes a 
linker polypeptide involved in the assembly of trimers (aP)3 and hexamers [(aP)3h and their 
assembly into phycobilisomes (5, 6). 

20 

c cpcE and cpcF encode two subunits of a heterodimeric phycocyanin a subunit lyase required 
for the addition of phycocyanobilm to the apophycocyanin a subunit (1, 2-4). 

TABLE 2 

25 Molecular weights of His-tagged phycocyanins in different aggregation states, as determined 
by size exclusion chromatography and by analytical ultracentrifugation. 

Protein* SEC-HPLC AUC Major Aggregation Expected 

(kDa) (kDa) Components state MW (kDa) 

Native PC 211.7 ' a-p Hexamer 225.7 

124.7 Trimer 112.8 

44.5 Monomer 37.6 
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between the promoter and the cpcB 

open reading frame 

cpcBAC 

cpcE c inactivated by transposon 
insertion " 
cpcE 

cpcF c inactivated by transposon 

insertion 

cpcF 

Wild-type 

Wild-type 



HTP 
HTa 

HTp 
HTa 

HTp 

HTaAl2T 
HTP S46G . N76D 
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Anl68 


301.6 




HTa:P 


Hexamer 


243.0 




145.3 


125.9 




Trimer 


121.5 




49.2 


34.8 




Monomer 


.' 40.5 


An262 


136.9 


121.9 


a:HTp 


Trimer 


121.5 






4*3 4 




Monomer 


40.5 


A11I68-BAC 


289.3 




HTa:p 


Trimer 


243.0 










1\A fin OTTl PT 


121.5" 




17.3 




PIT™ 


Subunit monomer 


20.9 




50 3 




HTp 


Subunit homodimer 


45.0 


Anl68-E/F 


106.8 




Apo-HTa:0 


Trimer 


119.7 




29.8 






Monomer 


39.9 


An262-E/F 


54.1 




HTp 


Subunit homodimer 


45.0 




33.8 




Apo-a:HTp 


Monomer 


39.9 



a "-BAC" suffix indicates proteins isolated from cpcBAC mutant strains of Anabaena sp., and 
,f -E" and ,f -F ,! indicate proteins isolated from cpcE and cpcF mutant strains of Anabaena sp., 
respectively. 

TABLE 3 



Spectroscopic properties of His-tagged phycocyanins a . 



Sample 


Abs max 




Em max 


£m (M^cnr 1 ) 


<£>f 




(nm) 


(nm) 


(nm) 


(xl,000) 










Native PC (a:P>3 


615-618 


615 


644 


929 ±2.1 


0.27 ± 0.06 


(HTa:P) 3 


615-621 


618 


642 


921 ±36 


0.22 ± 0.05 


(a:HTP) 3 


615-619 


616 


642 


888 ±33 


0.23 ± 0.04 


HTa 


618-622 


615 


637 


109 ±2.5 


0.23 ± 0.02 


(HTP) 2 


605 


606 


639 


: 365± 13 


0.22 ± 0.03 


apo-HTa:P 


607 


606 


642 


' 182±1.4 


0.24 ± 0.00 


One protein- 


660 b 






35.40 


0 


bound PCB 


280 






14.85 ±0.1 





a Absorbance maxima (Abs max ) are for protein concentrations between 0.05 and 3 jaM. Lower 
concentrations yield blue-shifted spectra in all cases where a range of wavelengths is given. 
Excitation and emission maxima (Ex™* and Em max ), and quantum yields of fluorescence 
(Of) are for samples at ^0.05 jjM. Molar extinction coefficients (S m ) were determined for the 
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following complexes purified by SEC-HPLC : native PC, trimer fraction; HTa.p* trimer 
fraction of Anl68; a:HT(3, trimer fraction of An262; HTa, subunit monomer fraction of 
Anl68-BAC; HTp, subunit homodimer fraction of An262-BAC; apo-HTa:P; monomer 
fraction of Anl68-F. 

b Absorbance of protein-bound phycocyanobilin was measured with SEC-HPLC purified 
(HTcc:|5)3 and (oc:HT0)3 holoproteins, denatured in 8 M urea pH 2.0. The molar extinction 
coefficient at 660 nm is identical to that measured in 7.2 M urea pH 2.0, 9 mM DTT (35), 
while that at 280 nm was calculated by subtracting contributions from Tyr [S = 1,370 M" 1 
cm- 1 ; (36)] and Trp [S = 5,500 M _1 cm' 1 ; (37)] residues. 



TABLE 4 

Expression plasmids with specific functional domains in the encoded protein 



Plasmid a 


Affinity 


Biospec. 


Coiled coil 


Protease 


Core 




tag 


tag b 


domain 


site 


protein 


pBS311 


6xHis 




GCN4pH 


TEV 


LacZ a 


pBS314 


6xHis 




GCN4pn 


TEV 


CpcA 


pBS319 


6xHis 






TEV 


CpcA-L9-CpcB 


pBS320 


6xHis 

i 






TEV 


CpcA-Ll 1 -CpcB 


pBS362 


6xHis 




GCN4pH 


TEV 


CpcA-Lll-CpcB 


pBS315 


6xHis 






TEV 


CpcB-Lll-CpcA 


pBS358 


6xHis 




GCN4pn 


TEV 


CpcB-Lll-CpcA 


pBS283 




Strep2 




TEV 


LacZ a 


pBS342 


6xHis 


Strep2 




TEV 


LacZ a 


pBS327 


6xHis 


Strep2 • 




TEV 


CpcA 


pBS303 




Strep2 


GCN4pLI 


TEV 


LacZ" 


pBS309 


6xHis 


Strep2 


GCN4pLI 


TEV 


LacZ« 


pBS323 


6xHis 


Strep2 


GCN4pLI 


TEV 


CpcA 


pBS312 


6xHis 




GCN4pLI 


TEV 


LacZ a 
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a Like the parent plasmids pBS150v and pBS150, all plasmids listed have two versions: the 
smaller one, indicated in the text with a suffix "v", is more suitable for cloning manipulations, 
but is lacking the 3.7-kb pDUlHC fragment that enables autonomous plasmid replication in 
Anabaena sp. PCC7120 (Example A, herein). 

h The biospecificity tag is positioned on the carboxyl-terminal side of the 6xHis tag, so that 
any recombinant molecules that have lost the biospecificity tag through proteolysis are not 
retained on the IMAC column. The Strep2 tag is a 10-residue peptide with specific affinity 
for streptavidin. 

TABLE 5 



Determination of apparent molecular weight of components in recombinant phycocyanin 
preparations by SEC-HPLC a 



Protein 


Calc. mass 
(kDa) 


Percent 
area 


Assembly 
state b 


Theoretical 
mass (kDa) c 


Anl68 


49.2 


5 


Monomer 


40.5 


(HTa:P) 


145.3 


95 


Trimer 


121.5 


An327 


44.2 


4 


Monomer 


41.4 


(HT-Strep2-a:p) 


137.9 


94 


Trimer 


124.3 




275.4 


2 


".Hexamer 


248.7 


An314 


110.2 


79 


Trimer 


132.7 


(HT-pII-a:P) 


406.0 4 


21 


(Hexamer)2 


530.7 


An321 


232.8 


13 


Tetramer 


176.9 


(HT-pLI-a:p) 


499.5 d 


81 


(Tetramer)3 


530.7 


An323 


264.0 


6 


Tetramer 


182.7 


(HT-Strep2-pLI-a:p) 


429.3 d 


94 


(Tetramer)3 


548.2 
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10 



An319 


35.8 


79 


Monomer 


41.0 


CHTa-L9-B) 


99.3 


19 


Trimer 


122.9 




232.6 


2 


Hexamer 


245.8 


An320 


34.1 


49 


Monomer 


41,1 


(HTa-Lll-p) 


89.7 


46 


Trimer 


123.4 




213.6 


5 


Hexamer 


246.7 


An362 


128.9 


84 


Trimer 


134.5 


(HT-pH-a-Lll-P) 


304.7 d 


15 


Hexamer 


269.1 




1 07 9 


76 


Trimer 


123.4 


(HTp-Lll-a) 


235.7 


23 


Hexamer 


246.7 


An358 


115.6 


64 


Trimer 


. 134.5 


(HT-pn-P-Lll-a) 


587.2^ 


35 


(Hexamer)2 


538.0 



15 



20 



a For experimental conditions, see text. 

b By convention, the a: (3 heterodimer is referred to as a phycocyanin "monomer." 
c Values are for holo-proteins. 

d Apparent molecular weight outside the range defined by the calibration standards. 



TABLE 6 

Spectroscopic properties of phycocyanin constructs 2 



25 



30 



Sample b 


Initial 


^-max c 


Amax 1 


Ex m ax 


Ern m ax 


SM 


Of 




assembly state 


(nm) 


A360nm 


(nm) 


(nm) 


(M^cnr 1 ) 




Denatured 




280 








44,550 d 




PC 




660 


1.0 






106,200 


o 


Native PC 


(a:P) 3 


615-619 


6.4 


615 


644 


929,000 


0.27 


Anl68 


(HTa:P) 3 


615-621 


6.4 


618 


642 


921,000 


0.22 


An327 


(HT-Strep2- 


615-619 


6.4 


618 


642 


922,000 


0.23 




a:P) 3 














An262 


(a:HTP) 3 


615-619 


6.3 


616 


642 


888,000 


0.23 


An314 


(HT-pn-a:P) 3 


621-622 


7.5 


622 


642 


927,000 


0.39 
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An321 (HT-pLI- 621-623 7.5 621 643 1,231,000 0.28 
a:p) 4 

An323 (HT-Strep2- 621-623 7.2 621 644 1,240,000 0.27 
pLI-a:p)4 



An315 (HTp-Lll-a) 3 615-622 6.2 622 643 900,000 0.21 

An358 (HT-pH-P- 619-621 6.7 620 643 915,000 0.29 

Lll-a) 3 

5 An320 (HTa-Lll-P) 3 605-608 5.2 613 643 677,000 e 0.22 

An362 (HT-pII-a- 607-610 5.8 615 642 707,000 f 0.24 

LH-P)3 

3 Trimer or tetramer components obtained by SEC-HPLC were used for all 
measurements. A max : A36O nm values, excitation maxima (Ex max )» emission maxima 
10 (Em max ), and quantum yields of fluorescence (Of) are given for measurements at 
protein concentrations <;0.05 mM. 8m values are given for measurements on the 
components specified in column 2 at protein concentrations >1 fuM. 



b Properties of proteins not described in this study are taken from Example A, herein. 

c ^max values were determined at protein concentrations from 0.05 to 3 jaM. Where 
the spectra shift to the blue with decreasing protein concentration, the limits of X, m ax 
values are those measured at the lowest and highest concentrations, respectively. 

20 d Absorbance from the three protein-bound PCBs only, excluding contribution from Tyr and 
Tip residues (Example A, herein). 
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TABLE 7 

List of fusion constructs indicating order of domains 

Fusion construct" Plasmid 



6xHis-BTN-TEV-LacZ a P BS339 

6xHis-BTN-TEV-GpcA P BS329 

6xHis-BTN-pII-TEV-CpcA P BS317 

6xHis-BTN-TEV-GIbN p BS355 

6xHis-TEV-GlbN P BS121 



6xHis-TEV-TBN-BCCP114 p BS344 

6xHis-TEV-CpcB-TBN-BGCP114 p BS353 

6xHis-GCN4pII-TEV-CpcB-TBN-BCCP114 p BS359 

6xHis-TEV-CpcA-Lll-CpcB-TBN-BCCP114 p BS361 

15 6xHis-GCN4pn-TEV-CpcA-Lll-CpcB-TBN-BCCP114 p BS365 

6xHis-TEV-CpcA-TBN-BCCP114 p BS351 

6xHis-GCN4pn-TEV-CpcA-TBN-BCCPH4 p BS357 

6xHis-TEV-CpcB-Lll-CpcA-TBN-BCCP114 p BS360 

6xHis-GCN4pn-TEV-CpcB-Lll-CpcA-TBN-BCCP114 p BS364 



6xHis-TEV-Strep p BS370 
6xHis-TEV-CpcA-Strep p BS386 
6xHis-TEV-CpcB-Ll 1-CpcA-Strep p BS394 



25 a Functional domains in each construct are listed in order from N- to C-terminus, with 
particularly relevant domains in bold face. CpcA and CpcB correspond to the a and P 
subunits of phycocyanin, respectively. 

b All plasmids have two versions. The smaller one, indicated in the text with a suffix "v", is 
for cloning and expression in E. coli, and lacks the 3.7-kb pDUlHC fragment that enables 
30 autonomous plasmid replication in Anabaena sp. PCC7120 (Example A). 
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TABLE 8 



Spectroscopic properties of phycocyanin constructs 3 



Sample 


Initial assembly 


^*max 


Axnax 




Em m ax 


SjyiCM^cm" 1 ) 






state b 


(nm) c 


/A36O nm 


(nm) 


(nm) 






Denature 




280 


— 


— 


— 


44,550 <» 


0 


d 




660 


1.0 






106,200 


0 


PC 
















Native 




615-619 


6.4 


615 


644 


929,000 


0.27 


PC 
















An! 

Do 








61 R 


642 


921 000 


0.22 


. An329 


(HT-BTN-a:p) 3 


615-618 


6.4 


617 


642 


920,000 


0.22 


An386 


HTa-Strep:p 


605-609 


5.6 


607 


639 


186,000 


0.19 


An262 


(a:HTP) 3 


615-619 


6.3 


616 


642 


888,000 


"0'.'23" 


An353 


(a:HTP- 


611-616 


5.7 


612 


642 


896,000 


0.21 




BCCP 114) 3 














An314 


(HT-pH-a:p) 3 


621-622 


7.5 


622 


642 


927,000 


0.39 


An317 


(HT-BTN-pII- 


621-622 


7.3 


622 


644 


939,000 


0.31 




a:P) 3 














An359 


(a:HT-pII-p- 


619-622 


6.6 






A1 < AAA 






BCCP 114) 3 














An320 


(HTa-Lll-p)3 


605-608 


5.2 


/CIO 

613 


£ A O 

o4i 


aaa e 




An361 


(HTa-Lll-p- 


605-606 


5.3 


604 


637 


664,000 e 


0.23 




BCCP 114) 3 














An362 


(HT-pn-a-Lll- 


607-610 


5.8 


615 


642 


707,066? . 


0.24 




P)3 














An365 


(HT-pH-a-Lll- 


605-607 


5.5 


606 


641 


692,000 f 


0.27 




p-BCCP 114 ) 3 















a Properties of constructs not described in this studies are taken from the accompanying 
studies (Example A, Example B). 

b Affinity-purified proteins were futher purified by SEC-HPLC, and collected fractions in 50 
mM Na-phosphate pH 7.0 buffer ± 1 mM NaN3 were used for all measurements. A ma x-A360 
nm values, excitation maxima (Ex ma x)> emission maxima (Em m ax) 5 and quantum yields of 
fluorescence (Of) are given for measurements at protein concentrations ^0.05 /jM. £m values 
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are given for measurements on the components specified in column 2 at protein 
concentrations >1 piM. 

c ^max values were determined at protein concentrations from 0.05 to 3 uM. Where the 
spectra shift to the blue with decreasing protein concentration, the A-max values are those 
5 measured at the lowest and highest concentrations, respectively. 

d Absorbance from the three protein-bound PCBs only, corrected for contributions from Tyr 
arid Trp residues (Example A). . 

e The phycocyanin a subunit domain has an average PCB content of -0.5. 
f The phycocyanin a subunit domain has an average PCB content of -0.25. 

10 
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FURTHER EXAMPLES ' 
1. Phvcobiliproteins carrying the GCN4-pII or the GCN4-pLI coiled-coil oligomerization 
domain. These fusion proteins were found almost entirely in inclusion bodies in E. coli, but 
found incorporated in the phycobilisome in Andbaena sp. These constructs, as well as other 
examples discussed below, all show that proteins bearing the GCN4 oligomerization domains 
are found almost entirely in inclusion bodies when expressed in E. coli, but are incorporated 
in phycobilisomes when expressed in Andbaena sp. Upon phycobilisome dissociation, good 
yields of soluble, fluorescently tagged proteins are obtained. 
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2. Strep tavidin-phvcocvanin a subunit fusion proteins. The core streptavidin (stvC) gene 
used here encodes a StvC corresponding to residues 16 to 133 of the mature streptavidin. 
Recombinant StvC with a 24-residue N-terminal extension bearing a 6xHis tag and the TEV 
site, HT-StvC (encoded by plasmid pBS282), was expressed relatively poorly in E. coli, with 

5 about 80% of the proteins found in inclusion bodies. The StvC-CpcA fusion protein has an 
1 1-residue bridge linking the two moieties. HT-StvC-CpcA (encoded by plasmid pBS292) 
was produced at relatively high level in E. coli, but was found only in inclusion bodies. 

The fusion protein An292 was well expressed in Anabaena sp. Ni 2+ -NTA-purified 
soluble An292 protein accounted for >15% of total cellular phycobiliproteins. The An292 

10 protein was of HT-StvC-a:p in composition, ran on SEC-HPLC as tetramer (HT-StvC-a:P)4 . 
and trimer of tetramers [(HT-StvC-a:p)4]3, similar to the finding with An321 with 
phycocyanin a subunit bearing the tetramerization domain GCN4-pLI: (HT-pLI-a:p)4 and 
[(HT-pLI-a:P)4]3. An292 also had spectroscopic properties virtually identical to those of 
An321, making it an excellent fluorescent label. The An292 protein was assembled into 

15 phycobilisomes and also found to bind biotin. 

Like streptavidin tetramers, the (HT-StvC-a:p)4 tetramer was much more stable than 
the (HT-pLI-a:P)4 tetramer. On room temperature denaturing SDS-PAGE, (HT-StvC-a:p)4 
released only the p subunits and retained the (HT-StvC-a)4 aggregate. The. tetramer could be 
broken only by boiling the protein before loading on SDS-PAGE. Purified phycobilisomes 

20 displaying StvC had a tendency to aggregate out of solution, likely a direct result of inter- 
phycobilisome linkage mediated by tetramerization of the monomelic StvC domains ; . 
displayed on PBS. Usually within 48 hrs all such phycobilisomes precipitated out of solution. 

3. Protein A-phvcocvanin a subunit fusion proteins. 

Expression of truncated protein A (SpA) in the cytosol of E. coli has been shown to 
25 give very poor yield, with particularly severe proteolysis occuring to the hinge of domains A 
and B. In our construct, the truncated SpA protein [denoted SpA(DABC)] contains the hinge 
of domain E, domains D, A, and B in entirety, and domain C lacking a small portion of its 
hinge. Domains D, A, B and C all retain their IgG-binding structure. Plasmid pBS356 
encodes the recombinant SpA with a C-terminal 6xHis tag extending from the residual hinge 
30 region of domain C: SpA(DABC)-6xHis. The protein, Ec356, was obtained in extremely poor 
yield when expressed in E. coli, regardless whether the purification method was denaturing 
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(using guanidine HC1; see e.g., Example B, supra) or not, and almost no full-length Ec356 
protein was seen on SDS-PAQE. Recombinant protein A with engineered hinge regions has 
been described for better expression of the SpA in E.coli . Without such extensive 
reengineering, E. coli is not an appropriate organism for expression and production of 
recombinant protein A. . 

In experiments described here, the SpA(DABC) protein is fused at its C-terminus to 
the phycocyanin a subunit through a 24-residue linker bearing a 6xHis tag and a recognition 
and cleavage site for the specific TEV endoprotease. The fusion protein, SpA(DABC)-6xHis- 
TEV-CpcA, encoded by plasmid pBS349, was obtained in very poor yield when expressed in 
E.coli, likely a result of extensive proteolysis. Little full-length Ec349 protein was seen, with 
most of the Ni2+-NTA-purified protein having some or all of the IgG~binding domains 
missing. 

When expressed in Anabaena sp., the fusion protein An349 was found assembled into 
phycobilisomes with no apparent destabilizing effect on the light-harvesting complex. 
Cellular phycobiliprotein level relative to chlorophyll a, however, was about 20% lower, 
suggesting an elevated phycobiliprotein turnover. Nonetheless, growth rate of cultures under 
high-light intensity was not affected. Ni24--NTA-purified An349 proteins were obtained in 
relatively low yield of about 5% of total cellular phycobiliproteins. 

On SEC-HPLC, the An349 proteins fractionated into two components: >80% as the 
trimer (SpA-6xHis-a:p)3 and the rest being SpA-6xHis-a subunit. SDS-PAGE showed that 
the SpA moiety of the majority of the fusion protein had also been proteolyzed, giving full- 
length SpA(DABC)-6xHis-ct (49.5 kDal; ca. 10%), SpA(ABC)-6xHis-ot (41.5 kDal; ca. 
20%), and SpA(BC)-6xHis-a (35.5 kDal; ca. 70%). No proteolytic cleavage on domain B 
hinge [giving SpA(C)-6xHis-a] and domain C hinge (giving 6xHis-a) was observed. The 
lack of proteolytic cleavage on the domain B hinge is a clear indication of different protease 
activities in Anabaena sp. because that hinge region has been found to be the most susceptible 
to proteolytic cleavage in E. coli. The lack of clipping in the domain C hinge is likely due to 
its proximity to the phycobilisome, minimizing access by proteases. This hinge region is 
susceptible to proteolytic activities in Anabaena sp. when displayed farther away from the 
phycobilisome surface (see below). 
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SDS-PAGE also showed that the carrier phycocyanin a domain had the normal bilin 
content. The SEC-HPLC trimer (SpA-6xHis-a:B)3 fraction had spectroscopic properties very 
similar to those of (6xHis-a:B)3, with absorbance maximum at 616 nm, an A616 nm:A360 
inn ratio of 6.1, fluorescence excitation maximum at 618 nm (for 650 nm emission), and 
fluorescence emission maximum at 641 nm (for 560 nm excitation). 

An349 proteins were tested for their ability to bind IgG in ELISA. Serially diluted 
An349 protein was immobilized on a 96-well plate and then allowed to bind to mouse IgG- 
alkaline phosphatase conjugate. After thorough washing, the alkaline phosphatase activity 
was assayed by catalyzed color development. In such semi-quantitative assays, the SpA- 
6xHis-a:B fusion protein had about 30% IgG-binding activity as compared to commercially 
obtained protein A (Sigma Chemical Co.) with all five IgG-binding domains. 

Another SpA-phycocyanin a fusion protein was constructed to enhance the fusion 
protein's utility. This construct is similar to the SpA(DABC)-6xHis-TEV-CpcA construct 
described above, but has the 33-residue GCN4-pII trimerization domain inserted between the 
6xHis tag and the TEV site. This fusion protein, SpA(DABC)-6xHis-pII-TEV-CpcA, 
encoded by plasmid pBS354, was found only in inclusion bodies when expressed in E. coli. 
Unlike Ec349, about 10% of the Ec354 proteins affinity-purified from inclusion bodies was 
full-length [SpA(DABC)-6xHis-pH-TEV-CpcA], suggesting fast sequestering of the newly 
synthesized polypeptides into inclusion bodies due to bundling by the GCN4-pH domain. The 
majority of purified Ec354 proteins, however, were still missing some or all of the SpA IgG-: 
binding domains, again illustrating protein As susceptibility to E. coli proteases. 

When expressed xnAnabaena sp., the fusion protein An354 was found assembled into 
phycobilisomes with no apparent destabilizing effect on the complex. Interestingly, cells 
expressing An354 had a more normal phenotype than those expressing An349 (see above). 
Ni2+-NTA-purified An354 proteins were obtained in reasonable yield, accounting for >7% of 
total cellular phycobiliproteins. 

On SEC-HPLC, the An354 protein ran as a single peak with an apparent molecular 
weight corresponding to a hexamer, [(SpA-6xHis-pII-a:Ji)3]2. SDS-PAGE confirmed the 
composition as SpA-6xHis-pII-a:B, and showed that most of the SpA moiety was subject to 
proteolytic clipping at different hinge regions, giving full-length SpA(DABC)-6xHis-pII-a 
(53.2 kDal; ca. 10%), SpA(ABC)-6xHis-pII-a (45.2 kDal; ca. 20%), SpA(BC)-6xHis-pII-a 
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(39.2 kDal; ca. 50%), and 6xHis-pII-a (25.2 kDal; ca. 20%). The complete removal of the 
displayed SpA protein by cleavage in the domain C hinge is not seen with the An349 protein, 
and is likely a result of the SpA moiety in An354 being displayed farther away from the 
phycobilisome surface. As with An349, no proteolytic cleavage on domain B hinge [giving 
SpA(C)-6xHis-pII-a] was observed in Anabaena sp. Different fractions collected from the 
An354 SEC-HPLC peak had identical distributions of the fusion proteins missing various 
IgG-binding domains, consistent with the view that the fusion proteins are displayed on the 
phycobilisome as monomers (with respect to the GCN4-pII domain), and only form GCN4- 
pll-bundled stable trimers after cell lysis and phycobilisome dissociation. 

SDS-PAGE also showed that the carrier phycocyanin a domain had the normal level 
of phycocyanobilin. The SEC-HPLC hexamer [(SpA-6xHis-pH-a:B)3]2 fraction had 
spectroscopic properties very similar to those of (6xHis-pII-a:P)3 (see, e.g. Example C, 
supra), with an absorbance maximum at 621 nm, an A616 nm:A360 nm ratio of 7.5, 
fluorescence excitation maximum at 621 nm (for 650 nm emission), and fluorescence 
emission maximum at 642 nm (for 560 nm excitation). These favorable spectroscopic 
properties of such GCN4-pII-bundled stable trimers make the fusion protein a good 
fluorescent tag. ELISA also showed that An354 had IgG-binding affinity comparable to 
commercially obtained protein A. The increased IgG-binding activity of An354 in ELISA 
compared to An349 may be a result of more protein being immobilized on the 96-well plate, 
and/or cooperative binding of GCN4-pII-bundled SpA proteins. Such oligomerization- 
induced binding affinity increase has been observed (see e.g., Examples B and C, supra). 
4. Cyanoglobin fusion to phvcocvanin aorfi subunit . Cyanoglobin (GlbN) is a 15.7-kDal 
monomeric hemoprotein produced in the cyanobacterium Nostoc commune. This protein 
contains a non-covalently linked heme, and is well expressed as soluble protein in both E. 
coli (expression at 30°C and 37°C) and Anabaena sp. when bearing a 24-residue N-terminal 
extension containing a 6xHis tag and the TEV recognition and cleavage site (recombinant 
protein encoded by plasmid pBS121; Example C, supra). However, recombinant cyanoglobin 
with C-terminal extension (the stop codon is replaced by a Ser codon) was found unable to 
fold normally in E. coli. Cyanoglobin-phycocyanin subunit fusion constructs, GlbN-6xHis- 
TEV-CpcA (encoded by plasmid pBS190) and GlbN-6xHis-TEV-CpcB (encoded by plasmid 
pBS274), gave relatively poor yield and were found mostly in inclusion bodies when 
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expression was carried out at 37°C. At 30°C, the two fusion proteins had somewhat different 
behavior Although both were still fast degraded in E. coli (thus giving low yield), the GlbN- 
HTa fusion protein (Eel 90) was produced in a substantial amount as soluble protein, with 
apparently normal amount of heme and an absorbance spectrum identical to that of 6xHis- 
5 tagged GlbN (Ecl21). The GlbN-HTb fusion protein (Ec274), on the other hand, was still 
found mostly in inclusion bodies, with the small amount of soluble proteins purified largely 
as apoproteins (without heme). These results indicate that the C-terminal extensions have 
slowed the folding of GlbN in E. coli, and the cyanoglobin domain in the GlbN-HTb fusion 
protein appears to be rnisfolded. 

10 Anabaena cultures expressing either fusion proteins were healthy, showing no 

negative phenotypes at all. Both fusion proteins, Anl90 and An274, were found assembled 
into phycobilisomes with no destabilizing effect on the macrostructures, were purified in high 
yield (each accounting for >20% of total cellular phycobiliproteins) with cognate partner 
subunits in 1 :1 ratio, and found to have spectroscopic properties very similar to native 

15 phycocyanin. 

The cyanoglobin moiety in the fusion proteins, however, behaved quite differently. 
Using phycocyanin a subunit as the carrier domain (Anl90), the displayed GlbN domain was 
<30% holo when eluted off the Ni2+-NTA column, and lost almost all of the bound heme 
upon dialysis to remove imidazole. Using the phycocyanin (3 subunit as the carrier domain 

20 (An274), most of the displayed GlbN domain was purified with bound heme, and the heme 
remained bound to GlbN after dialysis. No degradation of the cyanoglobin domain was 
observed in either Anl90 or An274 in Anabaena cells, suggesting that the cyanoglobin 
domain displayed on the N 1 of phycocyanin a subunit was nrisfolded but remained resistant to 
proteolysis. The rnisfolded GlbN domain appears to interfere with the carrier domain's 

25 formation of phycocyanin trimers: on SEC-HPLC, about 80% of Anl90 proteins ran as 

trimers, (GblN-HT a:p)3, while the rest ran as monomers, GblN-HT a:p, in contrast to Anl68 
which runs only as trimers, (HT a: [3)3. Like An262, (<x:HTB)3, An274 ran only as trimers, 
(a:GlbN-HTB)3, on SEC-HPLC. 

In sum, both fusion proteins were quickly degraded in E. coli, but well displayed on 

30 phycobilisomes in Anabaena sp. Cyanoglobin displayed on the of phycocyanin B subunit 
seemed to fold better than on the N-terminus of the a subunit, and was able to retain larger 
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amount of heme than the latter during purification. No degradation of the cyanoglobin 
domain was observed in either construct in Anabaena cells, suggesting that the cyanoglobin 
domain displayed on the N-terminus of phycocyanin a subunit was misfolded but remained 
resistant to proteolysis. 

5 5. Sperm whale myoglobin fusion to phvcocvanin a or ft subunit. The myoglobin from 

sperm whale (SWMJJ) is similar to cyanoglobin as a monomeric protein with non-covalently 
bound heme. SWMB with a C-terminal extension of only six His residues (SWMp-6xHis; 
encoded by plasmid pBSl 14) can be well expressed as soluble holoprotein in both E. coli and 
Anabaena sp. Recombinant SWM(3 proteins with larger C-terminal extensions, however, 

10 appear to be unable to fold. SWMp-6xHis-TEV-LacZ a (encoded by plasmid pBS261), 

SWMB-6xHis-TEV-CpcA (encoded by plasmid pBS271), and SWMb-6xHis-TEV-CpcB 
(encoded by plasmid pBS3 IS) fusion proteins all gave poor yields when expressed in E. coli 
and were found nearly entirely in inclusion bodies. 

Anabaena cultures expressing either An271 (SWMB-HTa) or An361 (SWMJ3-HTB) 

1 5 . fusion proteins were healthy, and the fusion proteins were found assembled into 

phycobilisornes, and were purified with cognate partner phycocyanin subunits. Although 
myoglobin displayed on the N-terminus of phycocyanin JB subunit (SWMB-HTB) gave 
slightly better yield of the full length protein (with little amount of bound heme), both fusion 
proteins were purified mostly without the myoglobin moiety. This indicates that the displayed 

20 domain, if unable to fold, is degraded. 

6. Maltose-binding protein fMalE. MBP) fasion-to phvcocvanin a or R subunit. The 
"mature* 1 recombinant MalE protein (lacking the membrane-crossing signal peptide) with a C- 
terminal 6xHis tag is well expressed as a soluble protein in the cytosol of both Anabaena sp. 
and E. coli. In the MalE-phycocyanin fusion constructs, the last three residues (Arg-Ile-Thr) 

25 of the mature MalE are replaced by the spacer Asn-Ser-Ser, and this modified MalE is fused 
to the N-terminus of a phycocyanin subunit via a 24-residue linker that bears a 6xHis tag and 
a recognition and cleavage site for the TEV protease. The MalE-6xHis-TEV-dc (Ec396) and 
MalE-6xHis-TEV-B (Ec398) fusion proteins are both expressed as soluble proteins in E. coli 
in relatively high yield. 
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The strain Anabaena sp. PCC7120(pBS396) expressing fusion protein MalE-6xHis- 
TEV-a had a phenotype very similar to that of strain Anabaena sp. PCC7120(pBS168) 
expressing HTa. Sucrose density gradient sedimentation fractionation showed that the MalE- 
6xHis-TEV-a fusion protein was assembled into the phycobilisome without obvious effects 
on the macromolecular complex. Ni 2+ -NTA affinity-purified protein, An396, was in relatively 
good yield, accounting for >10% of total cellular phycobiliproteins. Over 95% of the An396 
proteins can bind to, and be specifically eluted from amylose columns, diagnostic of 
functional MalE. 

SDS-PAGE showed that the An396 protein has the composition MalE-6xHis-TEV- 
a:B, and Zn2+-induced fluorescence indicates full chromophorylation of phycocyanin carrier 
domain. SEC-HPLC showed that An396 protein was monomelic, (MalE-6xHis-TEV-a:fi) a 
behavior differing from that of HTcc:B, whose elution volume is that of a trimer. The MalE- 
6xHis-TEV-oc:B protein in the SEC-HPLC monomer peak fraction, however, has 
spectroscopic properties very similar to the (HTa:B)3 trimer, with absorbance maximum at 
617 run, fluorescence excitation maximum at 618 nm (for 650 nm emission), fluorescence 
emission max. at 637 nm (for 560 nm excitation), and a fluorescence quantum yield of 0.23. 

Upon digestion with the TEV protease, over 90% of purified MalE-6xHis-TEV-a:6 
protein can be clipped at the TEV recognition site to separate the displayed domain (MalE) 
from the phycocyanin carrier domain. 

The fusion protein MalE-6xHis-TEV-B is expressed in Anabaena sp. at a level (>7.5% 
of total cellular phycobiliproteins) comparable to that of MalE-6xHis-TEV-cc (An396). Over 
95% of Ni2+-NTA-purified An398 protein can bind to, and then be specifically eluted from, 
the amylose resin, indicative of the MalE function. However, analysis of An398 by SDS- 
PAGE shows that the protein preparation consists mostly of MalE-6xHis-TEV-B with only a 
very small amount of the partner phycocyanin a subunit. Zn2-Mnduced fluorescence 
suggests that the MalE-6xHis-TEV-B protein has a full complement of phycocyanobilin, i. e., 
two PCBs per B subunit domain. On SEC-HPLC most of the preparation runs as (MalE- 
6xHis-TEV-B)2 homodimer, with a slight shoulder corresponding to the (cc:MalE-6xHis- 
TEV-B) monomer. Like MalE-6xHis-TEV~a:B (An396), virtually all of the MalE-6xHis- 
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TEV-B fusion proteins can be cleaved at the TEV site to separate the displayed protein 
(MalE) and the carrier protein (phycocyanin B subunit). 

The MalE-6xHis-TEV-B protein is found to be assembled into phycobilisomes 
without obvious deleterious effect on the latter structures. Since assembly of the fusion 
protein into the phycobilisomes requires association with the phycocyanin a subunit, this 
result indicates that the MalE-6xHis-TEV-.fi fusion protein has a reduced affinity to the a 
subunit which is likely lost during the purification process. In the absence of a subunits, 
phycocyanin B subunits have been shown to form stable homodimers (see, e.g. Example A, 
supra). Such interference by the displayed domain on the phycocyanin subunit carrier 
domain's affinity for its cognate partner subunit is likely fusion protein specific - 
cyanoglobin displayed on the N-terminus of the phycocyanin B subunit does not appear to 
have obvious effects on the fusion protein's affinity for the phycocyanin a subunit. 

The (MalE-6xHis-TEV-B)2 homodimer peak fraction in SEC-HPLC was used for 
spectroscopic characterization. Similar to the (HTB)2 homodimer protein, the (MalE-6xHis- 
TEV-B)2 homodimer has absorbance and fluorescence excitation maxima at 605 nm, a 
fluorescence emission maximum at 638 nm, and a fluorescence quantum yield of 0.26. 

To obtain 1:1 a:B stoichiometry, the B-Lll-a subunit-fusion phycocyanin (Example 
B, supra) can be used, in place of the B subunit, as a carrier protein. Constructs incorporating 
the GCN4-pII trimerization domain, such as (MalE-6xHis-pII-a:B)3 and (MalE-6xHis-pII-B- 
Ll l-a)3, have excellent spectroscopic properties, and are very useful, for instance, in studies 
of protein glycosylation. 

All publications and patent applications cited in this specification and all references 
cited therein are herein incorporated by reference as if each individual publication or patent 
application or reference were specifically and individually indicated to be. incorporated by 
reference. Although the foregoing invention has been described in some detail by way of 
illustration and example for purposes of clarity of understanding, it will be readily apparent to 
those of ordinary skill in the art in light of the teachings of this invention that certain changes 
and modifications may be made thereto without departing from the spirit or scope of the 
appended claims. 
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WHAT IS CLAIMED IS: 

1 . A fusion protein comprising a functional displayed domain and a functional 
phycobiliprotein domain incorporated in a functional oligomeric phycobiliprotein. 

5 2. A fusion protein according to claim 1 wherein the phycobiliprotein domain is a natural 
phycobiliprotein domain. 

3. A fusion protein according to claim 1 wherein the functional oligomeric phycobiliprotein 
is an a,B heterodimer. 

10 

4. A fusion protein according to claim 1 wherein the displayed domain comprises at least 
one of an affinity tag, an oligomerization moiety, a specific binding moiety, and a signaling 
moiety. 

15 5. A fusion protein according to claim 1 further comprising an additional moiety selected 
from an affinity tag, an oligomerization moiety, a specific binding moiety, and a signaling 
moiety. 

6. A fusion protein according to claim 1 further comprising a specific binding moiety 

20 selected from a streptavidin biotin-binding moiety, a biotinylated or biotinylatable moiety, 
and an antigen binding immunoglobulin moiety. 

7. A fusion protein according to claim 1 further comprising a linker peptide between the 
displayed domain and the phycobiliprotein domain. 

25 

8. A fusion protein according to claim 1 further comprising a protease cleavage site between 
the displayed domain and the phycobiliprotein domain. 

9. A fusion protein according to claim 1 wherein the phycobiliprotein domain comprises at 
30 least one functionally attached bilin. 
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10. A fusion protein according to claim 1 wherein the displayed domain is refractive to 
expression in E. coli. 



1 1. A functional oligomeric phycobiliprotein comprising a fusion protein according to claim 
5 1. 

12. A functional phycobilisome comprising a fusion protein according to claim 1. 

13. A cell comprising a functional oligomeric phycobiliprotein according to claim 6. 

10 - 

14. A cell comprising a functional oligomeric phycobiliprotein according to claim 6, wherein 

the cell naturally expresses a phycobiliprotein. 

15. A method for making a functional displayed domain, the method comprising the step of 
15 combining a polypeptide comprising a displayed domain and a phycobiliprotein domain with 

a phycobiliprotein subunit under conditions to form a fusion protein according to claim 1. 

16. A method according to claim 15 further comprising, after the combining step, the step of 
separating the functional displayed domain from the functional phycobiliprotein domain. 

20 

17. A method according to claim 15 further comprising, prior to the combining step, the step 
of making the polypeptide by expressing a nucleic acid encoding the polypeptide. 

18. A method according to claim 15 further comprising, prior to the combining step, the step 
25 of making the polypeptide by expressing a nucleic acid encoding the polypeptide, wherein the 

making and combining steps occur in a cell. 

19. A method according to claim 15 further comprising, prior to the combining step, the step 
of making the polypeptide by expressing a nucleic acid encoding the polypeptide, wherein the 

30 making and combining steps occur in a cell, and the cell is or is a progeny of a cell which 
naturally expresses a phycobiliprotein. 
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20. A method according to claim 19, wherein the fusion protein further comprises a 
functionally attached bilin. 

21. A method according to claim 19, wherein the fusion protein further comprises a linker 
peptide between the displayed domain and the phycobiliprotein domain. 

22. A method according to claim 19, wherein the fusion protein further comprises a protease 
cleavage site between the displayed domain and the phycobiliprotein domain. 

23. A method according to claim 19, wherein the displayed domain is refractive to 
expression in E. coli. 
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